DNA encoding triol polyketide synthase

ABSTRACT

DNA encoding triol polyketide synthase (TPKS) from Aspergillus terreus has been isolated, purified and sequenced. Expression vectors comprising said DNA, cells transformed with the expression vectors, and processes employing the transformed cells are provided.

CROSS-RELATED TO OTHER APPLICATIONS

This is a continuation of U.S. Ser. No. 08/148,132 filed. Nov. 2, 1993, a 371 of PCT/US94/12423 filed Oct. 28, 1994, which is now abandoned.

BACKGROUND OF THE INVENTION

Hyperchlosterolemia is known to be one of the prime risk factors for ischemic cardiovascular diseases such as arteriosclerosis. Cholesterol and other lipids are transported in body fluids by lipoproteins of varying density. The two lipoproteins carrying the majority of cholesterol in the blood are low-density lipoproteins (LDL) and high-density lipoproteins (HDL). The role of LDL is to transport cholesterol to peripheral cells outside the liver. LDL-receptors on a cell plasma membrane bind LDL and allow entry of cholesterol into the cell. HDL may scavenge cholesterol in the tissues for transport to the liver and eventual catabolism. LDL levels are positively correlated with the risk of coronary artery disease while HDL levels are negatively related, and the ratio of LDL-cholesterol to HDL-cholesterol has been reported to be the best predictor of coronary artery disease. Thus substances which effectuate mechanisms for lowering LDL-cholesterol may serve as effective antihypercholesterolemic agents.

Mevacor® (lovastatin; mevinolin) and ZOCOR® (simvastatin), now commercially available, are two of a group of very active antihypercholesterolemic agents that function by inhibiting the enzyme HMG-CoA reductase. Lovastatin and related compounds inhibit cholesterol synthesis by inhibiting the rate-limiting step in cellular cholesterol biosynthesis, namely the conversion of hydroxymethyl-glutarylcoenzyme A (HMG-CoA) into mevalonate by HMG-CoA reductase 3.7-9.12!. HMG-CoA reductase inhibitors act through cellular homeostatic mechanisms to increase LDL receptors with a consequent reduction in LDL-cholesterol and a resultant therapeutic antihypercholesterolemic effect. The HMG-CoA reductase inhibitors within this invention include, but are not limited to compactin (ML-236B), lovastatin, simvastatin, pravastatin, fluvastatin and mevastatin.

Many HMG-CoA reductase inhibitors are synthesized by microorganisms. The general biosynthetic pathway of the HMG-CoA reductase inhibitors of the present invention has been outlined by Moore et al., who showed that the biosynthesis of mevinolin (lovastatin) by Aspergillus terreus ATCC 20542 proceeds from acetate via a polyketide pathway (R. N. Moore et al., Biosynthesis of the hypocholesterolemic agent mevinolin by Aspergillus terreus. Determination of the origin of carbon, hydrogen, and oxygen atoms by ¹³ C NMR and mass spectrometry. J. Amer. Chem. Soc., 1985, 107: 3694-3701). Endo and his coworkers demonstrated that similar biosynthetic pathways existed in Pencillium citrinum NRRL 8082 and Monascus ruber M-4681 (A. Y. Endo et al., Biosynthesis of ML-236B (compactin) and monacolin K., 1985, J. Antibiot., 38: 444-448).

The recent commercial introduction of HMG-CoA reductase inhibitors has provided a need for high yielding processes for their production. Methods of improving process yield include, but are not limited to scaling up the process, improving the culture medium or, simplifying the isolation train. The present invention focuses on a method of increasing process yield wherein the increase in productivity is due to the use of a microorganism that produces increased levels of HMG-CoA reductase inhibitor.

It may be desirable to increase the biosynthesis of HMG-CoA reductase inhibitors at the level of gene expression. Such increases could be achieved by increasing the concentration in an HMG-CoA reductase inhibitor-producing microorganism of one or more of the enzymes or enzymatic activities in the biosynthetic pathway of the HMG-CoA reductase inhibitor. It may be particularly desirable to increase the concentration of a rate-limiting biosynthetic activity.

Triol polyketide synthase (TPKS) is a multifunctional protein with at least four activities as evidenced by the product of the enzymatic activity (Moore, supra). TPKS is believed to be the rate-limiting enzymatic activity(ies) in the biosynthesis of the HMG-CoA reductase inhibitor compounds.

The present invention identifies a DNA encoding triol polyketide synthase (TPKS) from Aspergillus terreus. The DNA encoding the TPKS of the present invention has been isolated, purified and sequenced. Complementary DNA (cDNA) and genomic DNA sequences corresponding to TPKS have been prepared. The TPKS cDNA of the present invention may be used to increase the production of HMG-CoA reductase inhibitors by HMG-CoA reductase inhibitor-producing microorganisms. The TPKS cDNA of the present invention may also be used to produce purified TPKS.

SUMMARY OF THE INVENTION

DNA encoding the full-length form of triol polyketide synthase (TPKS) is identified. The DNA is sequenced and cloned into expression vectors. Cells transformed with the expression vectors produce increased levels of TPKS and increased levels of HMG-CoA reductase inhibitors. The DNA is useful to produce recombinant full-length TPKS. The DNA may be used to isolate and identify homologues of TPKS present in organisms that are capable of producing polyketides, particularly microorganisms that are capable of producing HMG-CoA reductase inhibitors.

BRIEF DESCRIPTION OF THE DRAWINGS

FIGS. 1A-1T are the nucleotide sequence of triol polyketide synthase.

FIGS. 2A-2C are the predicted amino acid sequence of triol polyketide synthase.

FIG. 3 shows pTPKS100.

FIG. 4 is a graphic view of the open reading frame of the TPKS protein and the overall placement of the TPKS peptides and PKS activities established by alignments generated by the Intelligenetics GeneWorks program.

FIG. 5 shows the alignments of keto acyl synthase, acetyl/malonyl transferase and dehydratase carried out on regions of TPKS, rat fatty acid synthase (FAS) and P. patulum 6MSAS.

FIG. 6 shows the alignments of enoyl reductase, keto reductase and acyl carrier protein carried out on regions of TPKS.

FIG. 7 is a Chou-Fasman secondary structure prediction of pyridine nucleotide binding regions of TPKS and related proteins.

FIG. 8 shows the S-adenosylmethionine binding regions of a variety of prokaryotic and eukaryotic methyl transferases.

FIG. 9 is a Southern blot showing the homology of ketoacylsynthase of the TPKS of A. terreus to M. ruber and P. citrinum.

DETAILED DESCRIPTION OF THE INVENTION

The present invention relates to a DNA molecule encoding triol polyketide synthase (TPKS) which is isolated from TPKS-producing cells. Cells capable of producing TPKS include, but are not limited to, strains of Aspergillus terreus, Monascus ruber, Penicillum citrinum, Penicillum brevicompactum, Hypomyces chrysospermus, Paecilomyces sp M2016, Eupenicillium sp. MM603, Trichoderma longibrachiatum M6735 and Trichoderma pseudokoningii M6828.

TPKS, as used herein, refers to enzymatic activities that convert acetate precursors and S-adenosyl methionine to an intermediate in the triol biosynthetic pathway. This intermediate is further modified to produce a triol nonaketide. Polyketide synthases from bacteria and fungi employ common enzymatic functions to synthesize polyketides from two carbon units (for a review, see D. A. Hopwood and D. H. Sherman, 1990, "Comparison to fatty acid biosynthesis", Ann. Rev. Genet., 24: 37-66).

Polyketides are an important class of natural products because of their structural diversity and because many have antibiotic or other pharmaceutical activities. Most of the economically important polyketides are produced by fungi or actinomycetes.

Polyketide biosynthesis is similar to that of fatty acid biosynthesis in that it involves the sequential condensation of carboxylate units. Unlike fatty acids, which are built from acetate units, polyketides may be built from acetate, propionate, or butyrate units. Additionally, some or all of the β-keto groups added at each cycle of condensation during polyketide biosynthesis are left unreduced, or are reduced only to hydroxyl or enoyl functionalities. This variation in building units and the variation in modification of the beta-keto groups results in a tremendous variety of products as well as difficulty in comparing biosynthetic genes from different pathways.

Aspergillus terreus is a filamentous soil fungus; different strains of A. terreus produce a variety of polyketides (Springer, J. et al., 1979, terretonin, a toxic compound from Aspergillus terreus, J. Org. Chem., Vol. 44, No. 26, 4852-4854). Lovastatin is a polyketide produced by certain strains of A. terreus (Moore, supra). In addition to lovastatin and related metabolites such as triol or monacolin J, other polyketides found in A. terreus include sulochrin and related structures (Curtis, R. G. et al.,1964, "The biosynthesis of phenols", J. Biochem., 90: 43-51) derived from emodin (Fujii, I., et al., 1982, "Partial purification and some properties of emodin-o-methyltransferase from (+)-geodin producing strain of Aspergillus terreus". Chem. Pharm. Bull., 30(6):2283-2286); terreic acid (Sheehan, J. C. et al., 1958, J. Am. Chem. Soc., 80: 5536); patulin (D. M. Wilson, 1976, "Adv. Chem. Ser. No. 149") and citrinin (Sankawa, U. et al., 1983, "Biosynthesis of citrinin in Aspergillus terreus", Tetrahedron, 39(21): 3583-3591). Presumably each of these products is made by a specific PKS encoded by a specific and distinct PKS gene(s), thus increasing the difficulty in cloning the triol PKS.

The structure and activity of lovastatin was reported by A. Alberts et al., (Proc. Natl. Acad. Sci. U.S.A., 1980, 77: 3957-3961). Lovastatin is a reduced molecule consisting of a methylbutyryl group joined by an ester linkage to a nonaketide having a conjugated decene ring system.

Moore et al., (supra) described lovastatin biosynthesis. Proton and ¹³ C NMR studies of in vivo labeled lovastatin showed that all the carbons are derived from acetate except in the methyl groups at positions 6 and 2', which were derived from methionine. The triol molecule is composed of nine acetate units. The side-chain is composed of two acetate units. Esterification of triol and the butyrate side chain occurs enzymatically (Kimura, supra). The methyl butyrate side chain is presumably synthesized by a separate PKS. Lovastatin is first synthesized as a highly reduced precursor longer than 9 acetate units which undergoes reoxidation, including oxidative cleavage of a carbon-carbon bond.

Limited information is available for compactin biosynthesis. The most likely pathway would be nearly identical to that of lovastatin biosynthesis in M. ruber and A. terreus, except that methylation does not occur at the 6 position on the diene ring system.

Polyketide synthases (PKS) and fatty acid synthases (FAS) are classified by functional types. Type II enzymes, typical of bacteria and plants, have a separate polypeptide for each enzymatic activity. Type I enzymes, found in animals, bacteria and fungi, consist of large polypeptides with multiple activities or functional domains. Regions of amino acid sequence similarity have been identified in these genes: domains for ketoacyl synthase, acetyl/malonyl transferase, β-keto reductase, enoyl reductase, dehydratase and acyl carrier protein. The identification of these domains is considered evidence of the resulting enzymatic activity in light of the difficulty in obtaining functional Type I PKS in vitro (Sherman, supra).

Any of a variety of procedures may be used to molecularly clone the TPKS genomic DNA or complementary DNA (cDNA). These methods include but are not limited to, direct functional expression of the TPKS gene in an appropriate host following the construction of a TPKS-containing genomic DNA or cDNA library in an appropriate expression vector system. The preferred method consists of screening a TPKS-containing cDNA expression library constructed in a bacteriophage or vector with an antibody directed against the purified TPKS protein. The antibody is obtained by standard methods (Deutscher, M. (ed), 1990, Methods in Enzymology, Vol. 182) by isolating purified TPKS protein from HMG-CoA reductase inhibitor-producing cells, inoculating an appropriate host, such as a rabbit, with the purified protein and, after several boosts, collecting immune sera. Antibody collected from the animal is used to screen the cDNA expression library and cDNA clones expressing TPKS epitopes recognized by the antisera are selected. The positive clones are further purified, labeled and used to probe TPKS-containing genomic or cDNA libraries to identify related TPKS containing DNA. Standard restriction analysis of the related clones can be used to create a restriction map of the region and sequence analysis of the genomic and cDNA clones can be used to define a structural map and the open reading frame of the gene, respectively.

Another method of cloning TPKS involves screening a TPKS-containing cDNA library constructed in a bacteriophage or plasmid shuttle vector with a labelled oligonucleotide probe designed from the amino acid sequence of TPKS. The method may consist of screening an TPKS-containing cDNA library constructed in a bacteriophage or plasmid shuttle vector with a partial cDNA encoding the TPKS subunits. This partial cDNA is obtained by the specific PCR amplification of TPKS DNA fragments through the design of degenerate oligonucleotide primers from the amino acid sequence of the purified TPKS subunits.

It is readily apparent to those skilled in the art that other types of libraries, as well as libraries constructed from other cells or cell types, may be useful for isolating TPKS-encoding DNA. Other types of libraries include, but are not limited to, cDNA libraries derived from other cells or cell lines and genomic DNA libraries.

It is readily apparent to those skilled in the art that suitable cDNA libraries may be prepared from cells or cell lines which have TPKS activity. The selection of cells or cell lines for use in preparing a cDNA library to isolate TPKS cDNA may be done by first measuring cell associated TPKS activity using incorporation of radiolabelled acetate and separation of products by high performance liquid chromatography (HPLC).

Preparation of cDNA libraries can be performed by standard techniques well known in the art. Well-known cDNA library construction techniques can be found for example, in Maniatis, T., Fritsch, E. F., Sambrook, J., Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1982).

It is also readily apparent to those skilled in the art that DNA encoding TPKS may also be isolated from a suitable genomic DNA library. Construction of genomic DNA libraries can be performed by standard techniques well-known in the art. Well-known genomic DNA library construction techniques can be found in Maniatis et al., (supra).

In order to clone the TPKS gene, knowledge of the amino acid sequence of TPKS may be necessary. To accomplish this, TPKS protein may be purified and partial amino acid sequence determined by conventional methods. Determination of the complete amino acid sequence is not necessary. Once suitable amino acid sequences have been identified, the DNA sequences capable of encoding them are synthesized.

Because the genetic code is degenerate, more than one codon may be used to encode a particular amino acid, and therefore, the amino acid sequence can be encoded by any of a set of similar DNA oligonucleotides. Only one member of the set will be identical to the TPKS sequence but will be capable of hybridizing to TPKS DNA even in the presence of DNA oligonucleotides with mismatches. The mismatched DNA oligonucleotides may still hybridize to the TPKS DNA to permit identification and isolation of TPKS encoding DNA.

It is readily apparent to those skilled in the art that DNA encoding TPKS from a particular organism may be used to isolate and purify homologues of TPKS from other organisms. To accomplish this, the first TPKS DNA may be mixed with a sample containing DNA encoding homologues of TPKS under appropriate hybridization conditions. The hybridized DNA complex may be isolated and the DNA encoding the homologous DNA may be purified therefrom.

cDNA clones encoding TPKS may be isolated in a two-stage approach employing polymerase chain reaction (PCR) based technology and cDNA library screening.

Amino acid sequence information may be obtained by automated amino acid sequencing using Edman chemistry of both the intact protein and the peptide fragments generated by specific proteolytic cleavage. Following incubation for the prescribed periods, digestion is terminated and resulting peptide fragments are fractionated and detected.

TPKS in substantially pure form derived from natural sources according to the purification processes described herein, is found to be encoded by a single mRNA.

The cloned TPKS cDNA obtained through the methods described above may be expressed by cloning it into an expression vector containing a suitable promoter and other appropriate transcription regulatory elements, and transferred into prokaryotic or eukaryotic host cells to produce recombinant TPKS. Techniques for such manipulations are well-known in the art.

In order to simplify the following Examples and the Detailed Description, certain terms will be defined.

Expression vectors are defined herein as DNA sequences that are required for the transcription of cloned copies of genes and the translation of their mRNAs in an appropriate host. Such vectors can be used to express eukaryotic genes in a variety of hosts such as bacteria, bluegreen algae, plant cells, insect cells and animal cells. Expression vectors include, but are not limited to, cloning vectors, modified cloning vectors, specifically designed plasmids or viruses. Specifically designed vectors allow the shuttling of DNA between hosts, such as bacteria-yeast or bacteria-animal cells. An appropriately constructed expression vector should contain: an origin of replication for autonomous replication in host cells, selectable markers, a limited number of useful restriction enzyme sites, a potential for high copy number, and active promoters.

An expression vector is a replicable DNA construct in which a DNA sequence encoding a TPKS is operably linked to suitable control sequences capable of effecting the expression TPKS in a suitable host. Control sequences include a transcriptional promoter, an optional operator sequence to control transcription and sequences which control the termination of transcription and translation.

Certain vectors, such as amplification vectors, do not need expression control domains but rather need the ability to replicate in a host, usually conferred by an origin of replication, and a selection gene to facilitate recognition of transformants.

A promoter is defined as a DNA sequence that directs RNA polymerase to bind to DNA and initiate RNA synthesis. A strong promoter is one which causes mRNAs to be initiated at high frequency.

DNA encoding TPKS may also be cloned into an expression vector for expression in a host cell. Host cells may be prokaryotic or eukaryotic, including but not limited to bacteria, yeast, mammalian and insect cells and cell lines.

The expression vector may be introduced into host cells via any one of a number of techniques including but not limited to transformation, transfection, protoplast fusion, and electroporation. The expression vector-containing cells are clonally propagated and individually analyzed to determine whether they contain the TPKS gene or produce TPKS protein. Identification of TPKS expressing host cell clones may be done by several means, including but not limited to immunological reactivity with anti-TPKS antibodies, and the presence of host cell-associated TPKS activity.

Expression of TPKS DNA may also be performed using in vitro produced synthetic mRNA. Synthetic mRNA can be efficiently translated in various cell-free systems, including but not limited to wheat germ extracts and reticulocyte extracts, as well as efficiently translated in cell based systems, including but not limited to microinjection into frog oocytes, with micro-injection into frog oocytes being preferred.

PCR is the polymerase chain reaction, which is a technique for copying the complementary strands of a target DNA molecule simultaneously for a series of cycles until the desired amount is obtained.

Plasmids are generally designated by a low case p preceded or followed by capital letters and/or numbers. The starting plasmids used in this invention are commercially available, are publicly available on an unrestricted basis, or can be constructed from such available plasmids by conventional procedures. In addition other equivalent plasmids or constructs will be readily apparent to one skilled in the art.

Transformed host cells are cells which have been transformed or transfected with TPKS vectors constructed using recombinant DNA techniques. Expressed TPKS may be deposited in the cell membrane of the host cell or may be intracellular or may be secreted.

It is also well known, that there is a substantial amount of redundancy in the various codons which code for specific amino acids. Therefore, this invention is also directed to those DNA sequences which contain alternative codons which code for the eventual translation of the identical amino acid. For purposes of this specification, a sequence bearing one or more replaced codons will be defined as a degenerate variation. Also included within the scope of this invention are mutations either in the DNA sequence or the translated protein which do not substantially alter the ultimate physical properties of the expressed protein. For example, substitution of valine for leucine, arginine for lysine, or asparagine for glutamine may not cause a change in functionality of the polypeptide.

It is also well known that DNA sequences coding for a peptide may be altered so as to code for a peptide having properties that are different than those of the naturally-occurring peptide. Methods of altering the DNA sequences include, but are not limited to site directed mutagenesis. Examples of altered properties include but are not limited to changes in the affinity of an enzyme for a substrate. Alteration of the amino acid sequence may lead to altered properties that in turn result in the production of modified structures; for example, the elimination of one of the reductase activities may result in the biosynthesis of a less-reduced compound.

The full-length TPKS-encoding DNA in plasmid pLOA was designated pTPKS100. A sample of pTPKS-100 in E. coli strain JM109, was deposited under the terms of the Budapest Treaty, on Sep. 15, 1993 in the permanent culture collection of the American Type Culture Collection, at 12301 Parklawn Drive, Rockville, Md., 20852, and has been assigned the Accession number ATCC 69416.

The following examples illustrate the present invention without, however, limiting the same thereto.

EXAMPLE 1

Culture Conditions

Three strains of Aspergillus terreus were used. The two lovastatin-producing strains included A. terreus ATCC 20542. A lovastatin nonproducing strain was also used. A lovastatin-nonproducing strain or a lovastatin-overproducing strain of A. terreus may be derived from lovastatin-producing strains of A. terreus that are publicly available; an example of a publicly-available strain is A. terreus MF4833, which is deposited with the American Type Culture Collection under Accession No. 20542. One skilled in the art would appreciate that a variety of techniques such as mutagenesis techniques, including but not limited to ultraviolet irradiation, treatment with ethylmethanesulfonate (EMS), exposure to nitrous acid, nitrosoguanidine and psoralen-crosslinking, could be used to generate a strain that does not produce or which overproduces lovastatin. The extent of the mutagenesis may be determined in a variety of ways including auxotrophy, i.e., the requirement of the mutated strain for a specific growth substance beyond the minimum required for normal metabolism and reproduction of the parent strain as well as measurement of production of lovastatin by individual cultures. An alternative monitoring system involves the use of an intercalating dye such as acriflavine, which prevents any growth of the parent (lovastatin-producing) strain when plated at 10,000 spores per plate but, following mutagenesis, allows growth of about 3-5 colonies per plate. Alternatively, the extent of mutagenesis may be monitored by visual observation of colonies having morphologies or colors that differ from the unmutagenized parent strain. Mutant strains are reisolated and pooled and subjected to further mutagenesis so that, by repetition of these procedures, mutated strains of A. terreus that do not produce or which overproduce lovastatin may be obtained.

Monascus ruber ATCC 20657 and Penicillium citrinum ATCC 20606 were used in hybridization studies.

The strains were maintained on YME+TE medium. The recipe for YME+TE medium is as follows:

0.4% Yeast Extract (w/v);

1.0% Malt Extract (w/v);

0.4% Glucose (w/v);

0.5% Trace Element (TE; v/v); and

2.0% agar (w/v) in 1 liter of water, pH 7.2.

The recipe for Trace Elements (TE) is as follows:

0.1% FeSO₄ -7H₂ O (w/v);

0.1% MnSO₄ -H₂ O (w/v);

0.0025% CuCl₂.2H₂ O (w/v);

0.0132% CaCl₂.2H₂ O(w/v);

0.0056% H₃ BO₃ (w/v);

0.0019% (NH₄)₆ Mo₇ O₂₄.4H₂ O (w/v); and

0.02% ZnSO₄.7H₂ O (w/v) in 1 liter of water.

EXAMPLE 2

Fermentation Conditions

For the generation of spore stocks, single colonies were generated by growing on YME+TE plates for 8 days at 28° C. and 65% relative humidity. Single colonies were removed, and streaked on YME+TE slants. The slants were incubated for 8 days at 28° C. in 65% humidity. Spores were harvested by addition of 2 ml of Spore Suspension Solution (SSS). SSS contains 10% Glycerol (v/v) and 5% Lactose (w/v) in water. Spores were scraped into the SSS with a sterile inoculation loop and counted. The suspension was stored at -20° C.

A two-stage fermentation from spore suspensions was used for the production of lovastatin. A seed culture was started by inoculating 1×10⁸ spores into 2 ml/15 ml culture tube of HLC medium.

The recipe for HLC medium is as follows:

1.5% KH2PO₄ (w/v);

2.0% Cerelose (w/v);

0.1% Ardamine pH (Champlain Industries) (w/v);

1.5% Pharmamedia (Traders Protein) (w/v);

0.2% Lactic acid (v/v); and

0.4% ammonium citrate (w/v) in 1 liter of water.

The pH of HLC medium was adjusted to pH 7.2 before sterilization.

Cultures were shaken at a 30 degree angle at 28° C. for approximately 28 hours on a rotary shaker with a 70 mm diameter amplitude at 220 rpm. Two ml of seed culture was used to inoculate 25 ml of GP-9 medium in a 250 ml flask.

The recipe for GP-9 medium is as follows:

0.9% Ammonium Citrate (w/v);

0.12% Ardamine pH (w/v);

1.2% Cerelose (w/v);

4.0% Pharmamedia (w/v);

24.5% Lactose (w/v); and

0.2% P 2000 (v/v) in water at pH 7.2.

Incubation was continued as described for seed cultures without the 30 degree angle. Lovastatin production was monitored after 12 days of fermentation.

A one stage fermentation of A. terreus cultures in CM media was used to generate vegetative mycelia for transformations or DNA preparations. Fermentations were started by inoculating 1×10⁸ conidiospores into 50 ml of CM medium in a 250 ml flask and incubated as described.

The recipe for Complete Medium (CM) is as follows:

50 ml of Clutterbuck's salts;

2.0 ml Vogel's Trace elements;

0.5% Tryptone (w/v);

0.5% Yeast extract (w/v); and

1.0% Glucose (w/v) in one liter of water.

The recipe for Clutterbuck's salts is as follows:

12.0% Na₂ NO₃ (w/v);

1.02% KCl (w/v);

1.04% MgSO₄.7H₂ O (w/v); and

3.04% KH₂ PO₄ (w/v).

The recipe for Vogel's trace elements is as follows:

0.004% ZnCl₂ (w/v);

0.02% FeCl₃ (w/v);

0.001% CuCl₂ (w/v);

0.001% MnCl₂.4H₂ O;

0.001% NaB₄ O₇.10H₂ O (w/v); and

0.001% (NH₄)₆ MO₇ O₂₄.7H₂ O (w/v).

EXAMPLE 3

Construction of Vector, pLO9

pLO9 is a 5.6 kb vector constructed with features useful for both cosmid library construction and fungal transformations. For dominant selection in Aspergillus terreus, pLO9 contains a Streptoalloteichus hindustanus phleomycin resistance gene driven by an A. niger β-tubulin promoter and terminated by a Saccharomyces cerevisiae terminator sequence. For selection in Escherichia coli, the vector contains the ampicillin resistance gene and for lambda packaging, the vector contains a lambda cos site. The construction of pLO9 is described below.

The phleomycin resistance marker originated from S. hindustanus and the termination sequence is from the CYC1 gene in S. cerevisiae. Both sequences were isolated on one DNA fragment from pUT713 (CAYLA, Toulouse Cedex, France) by digesting pUT713 with the restriction enzymes BamH1 and BgIII. The isolated fragment was cloned into BamH1-digested pUC18 to produce vector pLO1. The genomic copy of the β-tubulin gene from A. niger ATCC 1015, was cloned as a 4.3 kb EcoR1 fragment in pUC8 to create p35-C-14. Several modifications were made to the genomic sequence. An EcoRI site was introduced at the initiator ATG by in vitro mutagenesis. The HindIII site in the promoter was removed by digestion with exonuclease, filling in with Klenow, and religation. Finally, an upstream EcoRI site was changed to a PstI site by digestion with EcoRI, filling in with Klenow and addition of a PstI linker by religation with ligase. The β-tubulin promoter was then subcloned as a PstI to EcoRI fragment in pUC8 to create pC15-1. An Xbal site was introduced at the initiator ATG by digestion with EcoRI, filling in with Klenow, addition of a Xbal linker and religation. The resulting vector was named pTL-113.

The β-tubulin promoter was cloned upstream of the phleomycin gene by cutting pTL113 with PstI and Xbal and cloning the isolated promoter fragment into the PstI and Xbal sites of pLO1 to produce pLO3. The BgIII site was removed with a fill in reaction followed by blunt-end ligation to produce vector pCS12. The PstI to HindIII fragment containing the beta tubulin promoter, phleomycin resistance gene, and the terminator sequence were cloned into a pUC8 vector to generate pLO6. The XbaI site at the ATG was removed by a fill-in reaction and ligation to give pLO7. The PstI to HindIII was moved as a fragment into a pUC18 backbone in which the XmaI site had been filled and replaced with a BgIII linker. The resulting vector was named pLO8. A PstI fragment containing the lambda cos site from pJL21 was inserted into the vector to generate pLO9.

EXAMPLE 4

Isolation of Genomic DNA

Vegetative mycelia were generated in CM media for 48 hr at 220 rpm at 28° C. Mycelia were collected by filtration through cheesecloth and frozen in liquid nitrogen for lyophilization overnight. Lyophilized mycelia were ground with sand using a mortar and pestle and suspended in 5 ml of Breaking Buffer (100 mM NaCl; 50 mM EDTA; 10 mM Tris, pH 8.0; 1% SDS; 50 ug/ml pancreatic RNase; 50 ug/ml Proteinase K). The mix was transferred to a 125 ml flask and an equal volume of Tris-saturated phenol/chloroform (50:50) was added. The flask was shaken for 1 hour at 37° C. and 200 rpm. The aqueous layer was removed after centrifugation at 10,000 rpm for 10 minutes. The aqueous layer was extracted twice more with phenol/chloroform and was then extracted once with chloroform. DNA was precipitated from the aqueous layer by addition of 0.1 volume 3M NaCl and 2.5 volumes of ethanol and then freezing at -70° C. for 10 minutes. The precipitated DNA was collected by centrifugation at 10,000 rpm for 15 minutes. The pelleted DNA was dried and resuspended in a solution of 10 mM Tris-HCl, 1 mM EDTA, pH 7.5. DNA concentrations were determined by measuring absorbance at wavelength 260 nM.

EXAMPLE 5

Construction of A. terreus Libraries

A. Preparation of Genomic Fragments

A. terreus genomic DNA was isolated as described. Large random DNA fragments for insertion into the vectors were isolated by partially digesting 10 μg of DNA with the restriction enzyme Sau3A. The digested DNA was electrophoresed on a 1.0% Agarose gel. For the genomic library, an area containing 9-23 kb sized fragments was cut from the gel. For the cosmid library, another segment of the gel containing 30-60 kb sized fragments was excised. The large chromosomal DNA fragments contained in the gel slices were isolated by electroelution. The DNA was concentrated by addition of 0.1 volumes of 3M sodium acetate and 2.5 volumes of ethanol, freezing at -70° C. for 15 minutes, and centrifugation at 10,000 rpm for 10 minutes to precipitate the DNA.

B. Construction of the A. terreus Cosmid Library

The pLO9 cosmid DNA was used to supply the two arms and cos sites required for lambda packaging. Two fragments were isolated from pLO9 for the packaging reaction.

Fragment one was isolated by digesting pLO9 with Xba1, phosphatasing with HK phosphatase (Epicenter Technologies), digesting with BgII, electroeluting on a 1% Agarose gel, concentrating by the addition of 0.1 volumes of 3M sodium acetate and 2.5 volumes of ethanol, freezing at -70° C. for 15 minutes and centrifuging at 10,000 rpm for 10 minutes to precipitate the DNA.

Fragment two was isolated by digesting pLO9 with SmaI, phosphatasing with HK phosphatase and then digesting with BgIII. Fragment two was isolated with the procedure described for fragment one. Fragment one, fragment two and isolated A. terreus insert DNA were ligated in a 1:1:2 ratio at a concentration of 0.5 μg of each DNA.

C. Packaging into Lambda Phage and Plating

Packaging into lambda phage was accomplished by mixing the ligation mixture with 10 μl of extract A from E. coli strain BHB2688 (Amersham) and 15 μl of extract B from E. coli strain BHB2690 (Amersham). The packaging mix was incubated at 22° C. for 120 minutes. A volume of 500 μl of SM (0.58% NaCl(w/v); 0.20% MgSO₄ (w/v); 0.05M Tris pH 7.5; 0.01% Gelatin(w/v)) and 10 μl of chloroform was then added to the packaging mix.

E. coli strain DH5 was prepared for transfection by growing cells to an optical density of 1.0 at wavelength 600 nm in LB+maltose medium. LB+maltose medium consists of 1.0% Bacto-tryptone (w/v); 0.5% Bacto-yeast extract (w/v); 1.0% NaCl (w/v); pH 7.5; 0.2% Maltose (v/v) is added after autoclaving.

The cells were centrifuged at 4,000 rpm for 10 minutes and resuspended in 10 mM MgSO₄. Fifty microliters of the packaging mix was added to 200 μl of the resuspended DH5 cells and incubated for 30 minutes at 37° C. A 500 μl of aliquot of LB medium was added and the mix was incubated for 30 minutes at 37° C. The cell mix was spread on LB agar plates containing 100 μg/ml ampicillin (Sigma) and incubated at 37° C. A total of 10,000 colonies were generated with this library.

D. Construction of the A. terreus Genomic Library

The lambda replacement vector, EMBL3 (Promega), was used for the construction of the genomic library. The vector was purchased as predigested arms ready for ligation with the genomic inserts. The two arms were ligated to the 9-23 kb genomic inserts at a ratio of 1:1:2, packaged into lambda phage, and plated for hybridization with selected probes as described above.

EXAMPLE 6

A. Isolation of Cosmid DNA from E. coli

The A. terreus cosmid library in E. coli was grown on 25 cm×25 cm plates containing 200 ml LB agar supplemented with 100 μg/ml ampicillin added. Nearly confluent colonies were scraped from plates in 10 ml of cold TS solution (50 mM Tris, pH 8.0 and 10% Sucrose(w/v)). A 2.0 ml aliquot of 10 mg/ml lysozyme made in 0.25M Tris, pH 8.0 was added; then 8 ml of 0.25M ethylenediamine tetraacetic acid (EDTA) was added. The mix was inverted several times and incubated on ice for 10 minutes. A 4 ml aliquot of a 10% SDS solution was added slowly while mixing gently with a glass rod. Next, 6.0 ml of 5M NaCl was added slowly while mixing with a glass rod. The cell lysate was incubated on ice for 1 hour and then centrifuged. The supernatant was saved and then extracted twice with an equal volume of Tris-saturated Phenol/Chloroform (50:50). DNA was precipitated by adding 2 volumes of ethanol, freezing at -70° C. for 15 minutes and then centrifuging at 3,000 rpm for 15 minutes. The precipitated cosmid DNA was dried and resuspended in 9 ml of Tris-EDTA.

Cosmid DNA was prepared for cesium chloride density gradient purification by dissolving 10 gm of CsCl2 in the DNA suspension and then adding 250 μl of 10 mg/ml ethidium bromide. Cosmid DNA was banded with a 20 hour centrifugation in a Ti865.1 Sorvall rotor at 55,000 rpm. The DNA bands representing cosmid DNA were recovered from the gradient, and ethidium bromide was removed by extraction with water-saturated butanol. Cosmid DNA was precipitated by adding 3 volumes of water and 10 volumes of ethanol, incubating on ice for 30 minutes and then centrifuging. The DNA was resuspended in Tris-EDTA and reprecipitated by the addition of 0.1 volume of 3M sodium acetate and 2.5 volumes of ethanol. DNA was frozen at -70° C. for 10 minutes, centrifuged, and resuspended in Tris-EDTA.

The DNA preparation was electrophoresed through a 0.5% Low Melting Temperature Agarose (BioRad) gel to eliminate contamination by pLO9 DNA. The band containing cosmid DNA with inserts was cut from the gel and heated to 65° C. with 2 volumes of Tris-EDTA. The melted agarose was extracted 3 times with Tris-saturated phenol and then once with chloroform. Cosmid library DNA was precipitated by addition of 0.1 volumes of 3M sodium acetate and 2.5 volumes of ethanol, freezing at -70° C. for 15 minutes, and centrifuging at 10,000 rpm for 15 minutes. The DNA was dried and resuspended in Tris-EDTA. The concentration of DNA was determined by measuring the optical density at 260 nm.

EXAMPLE 7

Transformation of A. terreus

Cultures were grown by inoculating 1×10⁸ conidiospores into 50 ml of CM media in a 250 ml Erlenmeyer flask. Cultures were grown for between 24 and 30 hr at 200 rpm and 28° C. Mycelia were harvested by gravity filtration through Miracloth. Mycelia (4 g) were transferred to a 500 ml Erlenmeyer flask containing 100 ml KMP. KMP consists of 700 mM KCl, 800 mM Mannitol, and 20 mM KH₂ PO₄ pH 6.3. Lysing Enzymes from Trichoderma harzianum (100 mg; Sigma) was added. Flasks were shaken at 100 rpm for 18 hours at 28° C.

Spheroplasts were harvested by gravity filtration through Miracloth. The filtrate was collected in 50 ml conical centrifuge tubes, concentrated by centrifugation and washed by resuspending the spheroplasted cells in 15 ml of KCM solution. KCM consists of 700 mM KCl; 10 mM MOPS adjusted to pH 5.8. The washing was repeated twice. Washed spheroplasts were resuspended at a concentration of 5×10⁷ /ml in KCMC. KCMC consists of 5% 1M CaCl₂ and 95% KCM.

For each transformation, a sample of 5 μg of DNA was brought to a volume of 20 μl in Tris-EDTA; then 5 units of heparin in 6.5 μl of KCMC was added. Next, 200 μl aliquot of the spheroplast suspension was added to the DNA-containing solution. Finally, 50 μl of aliquot of a solution containing 5% 1M CaCl₂ and 95% PCMC (40% PEG 8,000; 10 mM MOPS, pH 5.8; 0.05M CaCl₂) was added. The mixture was incubated on ice for 30 minutes.

An aliquot (600 μl) of the KCMC solution was added to a 45° C. equilibrated solution of MA. MA consists of 5% Clutterbuck's salts(v/v); 0.5% Tryptone (w/v); 0.5% Yeast Extract (w/v); 1.0% Glucose(w/v); 23.4% Mannitol(w/v) and 3% Agar. This suspension was divided among 5 preweighed petri dishes and incubated at 28° C. for 4 hours. The weight of agar in each plate was determined by a second weight and an equal amount of Overlay (OL) consisting of: 1% Peptone (w/v); 1% Agar (w/v); with between 100 μg/ml and 150 μg/ml (strain ATCC 20542) of phleomycin was added to each petri dish. Petri dishes were incubated at 28° C. and 65% humidity for 7-10 days before transformed colonies were picked.

EXAMPLE 8

Rescue of Cosmid DNA from A. terreus

The transforming cosmid DNA was rescued from an A. terreus transformants by isolating chromosomal DNA and packaging into lambda phage particles. Isolation of genomic DNA and packaging into lambda phage were performed as described above.

EXAMPLE 9

Detection of Lovastatin

Fermentation extracts were prepared by adding two volumes of reagent alcohol to the fermentation flasks and shaking the flasks were shaken for 15 minutes at 220 rpm and 28° C. The contents were allowed to settle for 15 minutes and 1 ml of the liquid was removed. The sample was diluted 1/20 in methanol, filtered and then analyzed by HPLC. Lovastatin was detected by a Waters HPLC using a 8 mm×10 cm C18 4 um Waters Novapak column. Mobile phases were A: Acetonitrile with 0.02% Trifluoroacetic acid and B: Distilled water with 0.02% Trifluoroacetic acid. Gradients were run at a flow rate of 1.5 ml/min. Initial conditions were 35% A and 65% B and were held for 1 minute after sample injection. A gradient was formed to 65% A and 35% B over 3 minutes and held for 3.6 minutes. Lovastatin ammonium salt was detected at 239 nm.

EXAMPLE 10

Southern Analysis of DNA

Southern analysis was performed by electrophoresing 5 μg of digested DNA on a 1.0% agarose gel in TAE buffer (0.04M Tris and 0.002M EDTA). DNA in the gel was denatured by soaking the gel in Solution A (1.5M NaCl and 0.5M NaOH) for 30 minutes. The gel was then neutralized in Solution B (1.0M Tris and 1.5M NaCl) for 30 minutes. DNA was transferred to nitrocellulose or nylon membranes by blotting overnight with a 10×SCC solution. SSC consists of 8.75% NaCl (w/v) and 4.4% sodium citrate (w/v), pH 7.0. DNA was baked onto the nitrocellulose at 80° C. under vacuum for 30 minutes.

Standard hybridization conditions were as described in Sambrook, J. et al., (Molecular Cloning, 1989 (ed. Chris Nolan) Cold Spring Harbor Press). Membranes were prepared for hybridization by incubating at 42° C. in hybridization buffer consisting of: 6×SSC, 5×Denhardt's reagent, 0.5% SDS, 100 μg/ml denatured and fragmented salmon sperm DNA, and 40% formamide. After incubating for two hours, the denatured labeled probe was added and further incubated overnight at 42° C. Unless otherwise stated, the filters were washed twice in 6×SSC and 0.1% SDS at room temperature for 15 minutes followed by two 30 minute washes at 42° C. in 0.1×SSC and 0.5% SDS. Filters were exposed to X-ray film for visualization of the signal.

EXAMPLE 11

A. Isolation of Triol Polyketide Synthase from A. terreus

Mycelia of A. terreus were grown in GP-9 medium. After 48 hours the mycelia were collected by vacuum filtration, washed with cold water, frozen in liquid nitrogen and lyophilized. All subsequent steps of the purification were performed on ice or at 3° C. unless otherwise noted.

Lyophilized mycelia (6 g) were homogenized by grinding with 20 gm glass beads (0.2 mm) in a mortar with pestle in 135 ml homogenization buffer consisting of: 20 mM Tris, pH 8; 10% glycerol; 5 mM EDTA; 50 mM NaCl; 5 mM ascorbic acid; 3.8 μg/ml leupeptin; 17.7 μg/ml chymostatin; 2.0 μg/ml pepstatin, 42 μg/ml turkey trypsin inhibitor, 0.2 mM PMSF; and 2.2% (dry wt/v) hydrated polyvinyl polypyrrolidone. The homogenate was centrifuged at 7,650×g for 10 minutes; and the supernatant applied to an SH-affinity column (Affi-gel 501 organomercurial agarose; Bio-Rad; 1.5×8.0 cm) equilibrated in Buffer A. Buffer A consists of 20 mM Tris, pH 8; 50 mM NaCl; 5 mM EDTA; 5 mM ascorbic acid; at 30 ml/hr. The column was washed with 25 ml Buffer A followed by 75 ml Buffer A containing 0.5M NaCl. After reequilibrating the column with 50 ml Buffer A, bound proteins were eluted with 40 ml Buffer A supplemented with 100 mM dithiothreotol (DTT). The eluted protein fraction was made 4.2 μg/ml leupeptin; 2 μg/ml pepstatin; 18 μg/ml chymostatin; 0.2 mM PMSF and then was pelleted by ultracentrifugation at 180,000×g for 16 hr. The supernatant was discarded, and the pellet was rinsed with a buffer consisting of 20 mM Tris, pH 8; 5 mM ascorbic acid; 1 mM DTT; 1 mM EDTA. The washed pellet was resuspended in 2 ml of buffer consisting of 40 mM Tris, pH 6.8; 20 mM DTT; 2% SDS, then heated to 90° C. for 10 minutes and put on ice.

A 250 μl aliquot of the resuspended pellet was combined with an equal volume of sample buffer (125 mM Tris, pH 6.8; 20% glycerol; 0.005%(w/v) bromphenol blue; 4%(w/v) SDS; 1.5M beta mercaptoethanol) and heated to 95° C. for 10 minutes. The sample was electrophoresed on a preparative 1.5 mm, 4% acrylamide SDS precast gel (Novex) at 145V for 2 hr using Laemmeli electrode buffer system (25 mM Tris; 192 mM glycine; 0.1% SDS). When a prestained 200 kD reference standard was 1.4 cm from the bottom of the gel, the electrophoresis was terminated.

Proteins were visualized as follow. The gel was rinsed for 5 seconds in distilled H₂ O, then rinsed for 10 minutes in 0.2M imidazole with shaking and was then transferred to a solution of 0.3M zinc acetate for 5 minutes with shaking. The gel was then rinsed in water. The TPKS, which ran with an apparent molecular weight of 235 kD, was localized to a relative mobility position of 0.53 (relative to the bottom of the gel). The TPKS protein was the protein of greatest abundance on the gel; no significant protein banding was seen with lower R_(f). The apparent 235 kD protein band was excised from the gel and was then destained in 0.25M Tris and 0.25M EDTA pH 9.5 for approximately 5 minutes.

The destained gel slice was crushed between two glass plates and placed in a 50 ml tube containing 5 ml of 20 mM Tris, 5 mM EDTA, 0.1% SDS, pH 8.0. The tube was shaken on a rotary shaker for 48 hours at 37° C. Gel fragments were removed by centrifugation, and the supernatant containing the eluted protein was concentrated to 100 μl with a Centricon 30 microconcentrator (Amicon).

B. Molecular Weight Determination

The gel-purified protein was resuspended in Laemmli load buffer, heated to 95° C. for 5 min. and then electrophoresed on a 4-15% gradient SDS polyacrylamide gel (BioRad Ready-Gel) in Laemmli electrode buffer. After staining, the molecular weight of the protein was determined by comparison to molecular weight standard proteins.

C. Antibody Production

The TPKS protein was prepared via preparative SDS-PAGE as described, except the protein was not electroeluted from the acrylamide gel matrix. Following destaining, the gel slice was crushed between two glass plates, and first forced through a 18 gauge syringe needle and then through a 25 gauge syringe needle. A 0.5 ml aliquot of the 25 gauge needle eluate was mixed with an equal volume of Freund's complete adjuvant and injected intradermally at five sites of a New Zealand white male rabbit. Boosts were done at 21 and 42 days using protein prepared as described, but mixed with 0.5 ml of Freund's incomplete adjuvant. Ten days after the final boost the rabbit was exsanguinated and the antiserum collected.

D. Affinity Purification of Antibody

Affinity purified antibody was prepared by immobilizing the TPKS protein to PVDF membrane by transfer from a preparative SDS polyacrylamide gel. The TPKS was visualized and that area of the membrane cut out. After blocking in 5%(w/v) non-fat dry milk in TTBS for 1 hour, the membrane was washed 3×5 minutes in TTBS. A 2 ml aliquot of antisera was diluted 1:1 with TTBS supplemented with 1% (w/v) non-fat dry milk and incubated with the immobilized antigen for 5 hours. The membrane was then washed 4× (10 minutes per wash) with TTBS, and the bound antibody was eluted with 2 ml of 0.1M glycine, pH 2.8. The eluted antibody was neutralized with 50 μl of 1.0M Tris, pH 9.5 and concentrated twenty-fold.

E. Western Blot Analysis

Purified TPKS protein and partially purified protein preparations of organomercurial eluates were resolved by 4% acrylamide SDS-PAGE (NOVEX, precast 1.0 mm thick gels) and then transferred to nitrocellulose in Towbin transfer buffer (25 mM Tris; 192 mM glycine, pH 8.3; 20% methanol; 0.05% SDS) at 240 mA for 2 hr. All subsequent steps were done at room temperature with shaking.

The nitrocellulose blot was rinsed for 1 minute in TBS (50 mM Tris, pH 7.5; 0.5M NaCl) and then blocked for 2 hours in TBS supplemented with 0.05% Tween 20 (TTBS) and 5% (w/v) non-fat dry milk. The blot was incubated with the primary antibody (a 1:1000 dilution of rabbit antisera in TTBS containing 1% (w/v) non-fat dry milk) for 16 hr. The blot was washed in TTBS 3 times for 5 min. The blot was incubated with the second antibody (goat anti-rabbit alkaline phosphatase conjugate diluted 1:1000) for 2 hr in TTBS supplemented 1% (w/v) non-fat dry milk. After washing 4 times (10 minutes per wash) in TTBS, color development was achieved with 5-bromo-4-chloro-3-indolyl phosphate (115 μg/ml) and nitroblue tetrazolium (330 μg/ml) in 66 mM Tris, pH 9.5; 0.1M NaCl; 5 mM MgCl₂.

EXAMPLE 12

Isolation of Aspergillus RNA

A. Isolation of Total RNA

A. terreus was grown for 48 hours in 25 ml of GP-9 fermentation medium at 28° C. and 220 rpm on a rotary shaker. Mycelia were collected by vacuum filtration through Miracloth and cheesecloth and washed with approximately 100 ml distilled water. The mycelia were scraped from the filter into a plastic beaker and frozen with liquid nitrogen. Frozen mycelia were stored at -80° C. until needed.

Frozen mycelia were weighed and placed in a mortar chilled with liquid nitrogen. Approximately 2 g of 0.2 mm glass beads were added, and the mix was ground to a fine powder with a pestle. Liquid nitrogen was added as needed to keep the mycelia frozen at all times. Ground mycelia were added to a flask containing approximately 2.5 ml/g Breaking Buffer (50 mM Tris pH 7.4; 150 mM NaCl; 5 mM EDTA; 5% SDS(w/v)) and an equal volume of Tris-saturated phenol:chloroform:isoamyl alcohol (50:50:1), and vanadyl ribonucleoside complex (BRL) to a final concentration of approximately 2 mM. The mixture incubated on a rotary shaker at 37° C. for 20 minutes and was then centrifuged at 12000×g for 10 min at 4° C. The aqueous layer was removed and extracted with an equal volume of Tris-saturated phenol:chloroform:isoamyl alcohol (50:50:1). Second and third extractions were done with 1M Tris-saturated phenol:chloroform (50:50) and chloroform, respectively. The final aqueous layer was mixed with an equal volume of 6M LiCl and left at -20° C. for at least 4 hours. The precipitate was pelleted at 12,000×g for 20 minutes at 4° C. and resuspended in 0.6 ml water treated with 0.1% diethyl pyrocarbonate (DEPC). The total RNA was reprecipitated with 0.1 volume of sodium acetate and 2.5 volumes ethanol. The final pellet was dissolved in 0.3 ml water treated with 0.1% DEPC.

B. Isolation of Polyadenylated RNA

Polyadenylated RNA was isolated by heating approximately 500 μg of total RNA in 0.2 to 1.0 ml water to 65° C. for 5 minutes, cooling on ice, and adding 10× sample buffer consisting of: 10 mM Tris pH 7.5; 1 mM EDTA; 5M NaCl in 0.1% DEPC-treated water to a final concentration of 1×. The treated sample was applied to a column of oligod(T) cellulose prepared according to the manufacturer's instructions (Poly(A)Quik™ mRNA purification kit--Stratagene). The column was washed twice with High Salt Buffer (10 mM Tris pH 7.5; 1 mM EDTA; 0.5M NaCl) and three times with Low Salt Buffer (10 mM Tris pH 7.5; 1 mM EDTA and 0.1M NaCl). PolyA mRNA was then eluted from the column with four 200 μl aliquots of Elution Buffer (10 mM Tris pH 7.5 and 1 mM EDTA) which had been heated to 65° C. RNA concentration was determined spectrophotometrically using absorbance at 260 nm.

EXAMPLE 13

Construction of Lambda gt-11 cDNA Library

A cDNA library was constructed using 4 to 5 μg of polyadenylated RNA that had been purified twice over an oligo(dT) column. The reagents for construction of cDNA, addition of adapters and ligation of lambda gt-11 arms except ³² P!dCTP were provided in the Superscript™ Choice System (BRL) and were used according to the manufacturer's instructions.

First strand synthesis was primed using either 0.05 μg random hexamers plus 0.5 μg oligo(dT)12-18 or 1 μg oligo(dT)12-18 alone. The reaction was carried out in a final volume of 20 μl (final composition: 50 mM Tris, pH 8.3; 75 mM KCl; 3 mM MgCl₂ ; 10 mM DTT; 500 uM each dATP, dCTP, dGTP, dTTP; primers; mRNA; 10 μCi ³² P!dCTP; 200 U Superscript™ reverse transcriptase/μg mRNA). The reaction mixture was incubated for 1 hr at 37° C. and then placed on ice.

Second strand synthesis was carried out in a final volume of 150 μl using 18 μl of the first strand reaction. The final composition of the reaction was: 25 mM Tris pH 7.5; .100 mM KCl; 5 mM MgCl₂ ; 10 mM (NH₄)₂ SO₄ ; 0.15 mM B-NAD+; 250 μM each dATP, dCTP, dGTP, dTTP; 1.2 mM DTT; 65 U/ml DNA Ligase; 250 U/ml DNA polymerase I; and 13 U/ml RNase H. This reaction mixture was incubated at 16° C. for 2 hr; then 10 U of T4 DNA polymerase was added, and the incubation was continued at 16° C. for an additional 5 minutes. The reaction was put on ice and stopped by adding 10 μl of 0.5M EDTA. The mix was extracted with 150 μl of Tris-saturated phenol:chloroform:isoamyl alcohol (25:24:1). The aqueous layer was removed, and cDNA was precipitated with 0.5 volume 7.5M ammonium acetate and 3.5 volumes ethanol. The cDNA pellet was washed with 70% ethanol. EcoRI (Not1) adapters were ligated to the cDNA in a reaction mix comprised of 66 mM Tris, pH 7.6; 10 mM MgCl₂ ; 1 mM ATP; 14 mM DTT; 200 μg/ml EcoRI (Not1) adapters; 100 U/ml T4 DNA ligase. The reaction mixture was incubated for 16 hours at 16° C., then heated to 70° C. and placed on ice. The adapted cDNA was phosphorylated by adding 30 U of T4 polynucleotide kinase to the reaction mix and incubating for 30 minutes at 37° C. The kinase was inactivated by heating to 70° C. for 10 minutes. The completed reaction was diluted with 97 μl of TEN buffer (10 mM Tris, pH 7.5; 0.1 mM EDTA; 25 mM NaCl) and placed over a Sephacryl® DNA sizing column prepared according to the manufacturer's directions (BRL). The DNA was eluted with TEN buffer and fractions were collected. Cerenkov counts were obtained for each fraction and the amount of cDNA/fraction was calculated. The column fractions were pooled in order of elution until 50 ng cDNA was collected. The pool was then precipitated with 5 μl yeast tRNA, 0.5 volumes 7.5M ammonium acetate and 2 volumes ethanol (-20° C.). The resultant pellet was washed with 70% ethanol, dried and ligated to lambda gt-11 arms. The final composition of the ligation reaction was 50 mM Tris pH 7.6; 10 mM MgCl₂ ; 1 mM ATP; 5% PEG 8000(w/v); 1 mM DTT; 100 μg/ml lambda vector EcoRI arms; 10 μg/ml cDNA; and 200 U/ml T4 DNA ligase. This mixture was incubated for 3 hours at room temperature. The cDNA/lambda gt-11 ligation was packaged into infectious lambda phage particles as described above.

EXAMPLE 14

A. Antibody Screening of Lambda gt-11 Library

E. coli strain Y1090 was used as the host for lambda phage infections and was maintained on LB/ampicillin plates consisting of: 1% tryptone (w/v); 0.5% yeast extract (w/v); 0.5% NaCl (w/v); 1.5% agar (w/v); the pH was adjusted to 7.5 before autoclaving, and 100 μg/ml ampicillin added after autoclaving. Cultures were grown for phage infection by incubating a single colony overnight on a rotary shaker at 37° C. in 3 ml LB/maltose broth consisting of: 1% tryptone(w/v); 0.5% yeast extract(w/v); 0.5% NaCl(w/v) and 0.2% maltose(w/v).

B. Pretreatment of Antisera

Antisera were treated with an E. coli lysate prior to screening so as to decrease cross-reaction to E. coli protein. E. coli lysate was prepared from Y1090 cells grown overnight in LB broth at 37° C. on a rotary shaker at 220 rpm. Cells were pelleted by centrifugation at 10,000×g at 4° C. and resuspended in 3 ml Lysate Buffer (50 mM Tris pH 8.0 and 10 mM EDTA). Cells were frozen in a dry ice/ethanol bath and thawed at room temperature; the freeze/thaw process was repeated. The suspension was sonicated 5×10 seconds at output control 4 on a constant duty cycle using a Branson Sonifier 450. Cells were placed on ice for 10 seconds after each pulse. Protein concentration in the lysate was estimated using the Bradford Assay (Bio-Rad) according to the manufacturer's suggestion. Sonicated lysate was stored at -20° C. until needed. The antisera was diluted 10-fold with TBST plus 1% dried milk(w/v) and mixed with 1/20 volume E. coli lysate. This solution was incubated at room temperature on a rotary shaker for two hours.

C. Screening of Lambda Gt-11 Phage Plaques

Recombinant phage diluted to 6×10³ pfu in 100 μl of SM was added to 600 μl of an overnight culture of E. coli Y1090 and absorbed at 37° C. for 30 minutes. The cells were then added to 7.5 ml of a 47° C. solution of LB Top Agarose/MgSO₄ (0.1% tryptone(w/v); 0.5% yeast extract(w/v); 0.5% NaCl(w/v); 10 mM MgSO₄) and plated on a 140 mm LB agar plate. The plate was incubated at 42° C. for approximately 5 hours until tiny plaques were visible. The plate was then overlaid with a 137 mm nitrocellulose filter which had been saturated with a 10 mM solution of IPTG (isopropyl-B-D-thiogalactopyranoside) and air-dried. Incubation of the plate was continued overnight at 37° C. The filter was removed and washed 3 times for 15 minutes each. All washes were carried out at room temperature on a rotary shaker in TBST. The filters were blocked in TBST plus 5% w/v dried milk (Carnation instant non-fat dried milk) for 30 minutes at room temperature on a rotary shaker. Filters were washed 3×15 minutes and then incubated with a 1:1000 dilution of goat anti-rabbit IgG alkaline phosphatase conjugate (Bio-Rad) in TBST plus 1% dried milk(w/v) for 2 hours. The filters were washed 3×15 minutes and then developed in AP buffer (100 mM Tris pH 9.5; 100 mM NaCl; 5 mM MgCl₂) to which was added NBT (nitroblue tetrazolium) to a final concentration of 0.33 mg/ml and BCIP (5-bromo-4-chloro-3-indoyl phosphate) to a final concentration of 0.165 mg/ml for 2-5 minutes. The color reaction was stopped by washing the filters with water. Positive plaques were picked to 1 ml SM plus 10 μl chloroform and stored at 4° C. until needed.

Positive plaques were further purified until all the plaques on a filter were positive. Purification rounds were done on 100 mm LB/agar plates with phage titer adjusted to approximately 100 pfu/plate. Positive plaques were confirmed by screening with an affinity-purified antibody at a dilution of 1:100.

EXAMPLE 15

Preparation of Lambda DNA

Phage were adsorbed to 1.5 ml of an overnight culture of E. coli Y1090 at a multiplicity of infection of 0.01 for 30 minutes at 37° C. and then added to 300 ml LB media. The cells were incubated at 37° C. on a rotary shaker about 6 hours (until the cells lysed). One ml chloroform was added to complete the lysis. Cell debris was pelleted by centrifugation at 10,000×g for 10 minutes at 4° C. Lysate was stored at 4° C. until needed.

Lysate was treated with DNase I (final concentration 1 μg/ml) and RNase H (final concentration 5 μg/ml) at 37° C. for one hour. Phage were pelleted by centrifugation for 90 minutes at 27,000 rpm in a Sorvall AH-629 rotor; and the tubes were inverted to drain. Phage pellets were resuspended in 200 μl 0.05M Tris, pH 8 and were extracted with 200 μl Tris-saturated phenol by vigorous shaking for 20 minutes. The mixture was spun in a microcentrifuge, and the aqueous layer saved. The aqueous layer was extracted with phenol and then extracted twice with 200 μl chloroform. DNA was precipitated with 0.1 volume 3M sodium acetate and 6 volumes ethanol at room temperature. DNA was pelleted in a microcentrifuge, washed with 70% ethanol, dried and resuspended in 100 μl TE pH 8.0 (10 mM Tris; 1 mM EDTA).

EXAMPLE 16

Screening of EMBL3 Genomic Library

The EMBL3 genomic library was plated for screening with ³² P-labeled DNA probes. Approximately 10,000 plaques were plated and transferred to nitrocellulose for hybridizations. Filters were prehybridized for 2 hours and hybridized overnight in hybridization buffer in the presence of a DNA probe labeled with ³² P-dCTP (Oligolabeling Kit, Pharmacia). For the selection of EMBL-1, the DNA probe consisted of the EcoRI cDNA insert of lambda gt-11 2-9 which was identified using the antibody to the 235 kD protein. Filters were washed using the protocol employed for Southern hybridizations, and positive plaques were identified after an overnight exposure to film. DNA from positive EMBL-3 phage was prepared as described.

EXAMPLE 17

Sequencing Strategy and Analysis

A series of overlapping subclones from the genomic EMBL1 clone, which contained the triol PKS gene, were constructed in M13mp18 and M13mp19. Nested deletions of some of the clones were obtained using the Cyclone I Biosystem (International Biotechnologies, Inc., New Haven, Conn.). Single stranded DNA was purified by precipitation with 20% polyethylene glycol-2.5M NaCl followed by phenol extraction and ethanol precipitation. The nucleotide sequence of both strands of the DNA was determined using the USB Sequenase Version 2.0 DNA Sequencing Kit (United States Biochemicals, Cleveland, Ohio). The -40 sequencing primer from the kit or custom synthesized oligonucleotides were used to prime the reactions. Regions containing GC compressions were resequenced using dITP in place of dGTP. The sequencing reactions were separated on 6% polyacrylamide denaturing gels. The genomic M13 clones were resequenced using a 373A DNA Sequencer (Applied Biosystems, Inc.) for verification. Introns were identified by sequence analysis of cDNA. The RNA was prepared from a 16 hr culture grown in GP9 medium, and cDNA was synthesized using AMV reverse transcriptase. Custom synthesized oligonucleotides were used to amplify short overlapping stretches of the cDNA by PCR. The PCR conditions, reagents, and product purification were performed as described for PCR with genomic DNA in the PCR/Sequencing Kit PCR Amplification Module manual (Applied Biosystems, Inc., Foster City, Calif.). The PCR were performed using a Perkin Elmer GeneAmp PCR system 9600. The PCR products were sequenced as described in the Taq DyeDeoxy Terminator Cycle Sequencing Kit manual (Applied Biosystems, Inc.), and sequencing reactions were analyzed using the 373A DNA Sequencer. All sequence analyses and manipulations were performed using GeneWorks (IntelliGenetics, Inc., Mt. View, Calif.) on a Macintosh computer (Apple Computer, Inc., Cupertino, Calif.).

EXAMPLE 18

A. Construction of pTPKS100

The transformation vector pTPKS100 contains the polyketide synthase gene responsible for the synthesis of the nonaketide backbone of the triol structure, the phleomycin resistance gene for selection in A. terreus and the ampicillin resistance gene for selection in E. coli.

The vector was constructed from the pUT715 vector (Cayla, Toulouse Cedex, France) which contains the phleomycin resistance marker from S. hindustanus and the termination sequence from the Cyc1 gene in S. cerevisiae. The pUT715 vector was digested with BamHI and EcoRv. The β-tubulin gene promoter was inserted in front of the phleomycin marker gene as follows. The β-tubulin promoter was removed from pTL113 by digestion with EcoRI, filling with Klenow fragment, and releasing the fragment from the vector with a BgIII digest. The β-promoter was ligated into the pUT715 vector to form pCLS7. The β-tubulin promoter, phleomycin marker and Cyc1 terminator were removed from PCLS7 by digestion with Ndel and BgIII followed by filling in the sites, and ligating into the SmaI site of the Bluescript vector (Strategene). This vector was named pLOA.

The polyketide synthase gene was inserted into pLOA in a two step process. The promoter and 5'-end of the PKS gene was obtained from EMBL-1 as a KpnI to EcoRI fragment and ligated into pLOA which had been digested with KpnI and EcoRI. This vector was named TPKS A. The 3' end of the PKS gene was then added to the construction by digesting TPKS A with EcoRI and ligating in the 3' EcoRI gene fragment isolated from EMBL-1. The resulting vector was named pTPKS100.

Transformation of a lovastatin-nonproducing strain with pTPKS100 restored lovastatin production. Transformation of ATCC 20542 (a lovastatin-producing strain) increased lovastatin production relative to untransformed cells.

EXAMPLE 19

Transformation of A. terreus ATCC 20542

To determine whether increasing the copy number of the PKS gene in a lovastatin-producing strain would result in an increase in the amount of lovastatin produced, a set of experiments were designed and carried out using the A. terreus ATCC 20542. ATCC 20542 was transformed with pTPKS-100. Transformants were checked by PCR to confirm that they contained the phleomycin marker and were true transformants. Following single spore isolation, the confirmed transformants were fermented and lovastatin production was measured by HPLC. The highest producer of single isolates, strain 3-17-7#7, was 32% greater for the transformant than for the parent.

EXAMPLE 20

Characterization of the TPKS Protein Sequence

Splicing of the introns from the DNA sequence and translation of the 9114 nucleotide open reading frame results in a protein of 3038 amino acids with a molecular weight of 269,090 daltons. The final amino acid sequence of the TPKS protein is shown in FIGS. 2A-2C. The features discussed below are presented with their amino acid position noted in the following table.

    ______________________________________                                         TPKS PROTEIN FEATURES                                                          Description      Motif        Amino Acid                                       ______________________________________                                         Keto-acyl synthase                                                                              Cysteine     181                                              Acetyl/Malonyl Transferase                                                                      GXSXG        654-658                                          Dehydratase      HXXXGXXXXP   985-994                                          Methyl Transferase                                                                              GXGXG        1446-1450                                        Enoyl Reductase  SXGXXS       1932-1937                                        Keto Reductase   LXGXXG       2164-2169                                        Acyl Carrier Protein                                                                            Serine       2498                                             ______________________________________                                    

Inspection of the TPKS amino acid sequence for active site residues and motifs known to be associated with polyketide synthases and fatty acid synthase (FAS) activities resulted in the identification of candidates for expected functional sites. These sites were identified by carrying out searches for amino acid sequences and amino acid homologies using the Intelligenetics Gene Works program. A graphic view of the open reading frame of the protein and the overall placement of the TPKS peptide sequences obtained by partial sequence analysis of TPKS peptides and PKS activities established by alignments and is shown in the figures. Except for the presence of a methyl transferase, not present in FAS, the succession of activities on the TPKS protein is the same as that observed for the rat FAS protein. The alignments carried out on regions of the TPKS, the rat FAS, and the 6-methylsalicyclic acid synthase (6-MSAS) of Penicillium patulin in order to identify the best candidate for each of the activities are also presented in the figures.

EXAMPLE 21

Identification of the Keto Acyl Synthase Region

The most 5' site is the β-keto acyl synthase (KAS), also known as the condensing enzyme. This activity is centered around the active site cysteine to which the acyl chain is attached prior to the entry and condensation of the incoming acyl unit. The region shown in the Keto Acyl Synthase Alignment figure contains 30% homology when compared to both the rat FAS and 6-MSAS sequences. However, the TPKS KAS region is most closely related to the rat FAS sequence, exhibiting 49% homology over this region compared to 41% to 6-MSAS.

EXAMPLE 22

Identification of the Acetyl Malonyl Transferase

Proceeding towards the COOH terminus, the next functional site identified is the acetyl/malonyl transferase, which is responsible for accepting the incoming substrate for transfer to either the active thiol of the beta-keto synthase (if a priming acetyl unit) or to the active site thiol of the ACP-pantetheine-SH if a malonyl building block. The identification of the acetyl/malonyl transferase site was found by searching for the GXSXG motif found in many proteins with an active site serine (Wakil, S. J., 1989, Biochemistry, 28: 4523-4530). The conservation of this motif in the TPKS protein was observed beginning at amino acid 654, as shown in the figures.

EXAMPLE 23

Identification of the Dehydratase

The next site in common with the FAS protein is the dehydrates. The dehydratase motif consistently found not only in the rat FAS, but the 6-MSAS and the erythromycin SU4 as well consist of a "HXXXGXXXXP" sequence (Donadio, S. and Katz, L., 1992, Gene, 111, 51-60.). The homology outside of this signature sequence is very weak.

EXAMPLE 24

Identification of the Enoyl and Keto Reductase

The next two activities identified on the rat FAS protein are the enoyl reductase (ER) and keto reductase (KR). In general, the ER and KR are identified by searching for the GXGXXG/A motif which is proposed to represent the pyridine nucleotide binding site in many proteins (Wierenga, R. K. and Hol, W. G. J., 1983, Nature, 302, 842-844). An identical match to this motif has been identified in the rat FAS for both the KR and ER (Witkowski, V., et al., 1991, Eur. J. Biochem., 198, 571-579). Inspection of the TPKS protein identified three matches to the motif. The first begins at position 321 between the β-keto synthase and acetyl/malonyl transferase functions. However, this is not considered to be a good candidate for either of the reductase activities due to its 5' position in the protein and because it lies in a region which is highly homologous to rat FAS. The GXGXXG motif is seen again at position 1446-1451, however, this is considered to be part of the methyl transferase domain. The third time the motif occurs is at position 2438 which lies 60 amino acids 5' of the ACP active site serine. A similar GXGXXG motif is seen in the rat FAS at 125 amino acids prior to the ACP and in 6-MSAS 129 amino acids 5' of the ACP. Since candidates for the NAD(P) binding sites of the KR and ER were not observed in the TPKS protein, homology searches were performed between the regions of the rat FAS which contain these sites and similar regions of the TPKS protein.

As shown in the Enoyl Reductase Alignment, the region of the TPKS protein which lies between the dehydratase and the keto reductase and shows the best alignment to the rat FAS enoyl reductase does not bear a strong homology to the GXGXXG motif or to the region in general. A much stronger homology is evident between the ER domain of SU4 of Erythromycin AII and the rat FAS sequence. The Keto Reductase Alignment of the rat FAS and 6-MSAS keto reductase regions with the TPKS shows slightly higher homology, with 6 out of 30 amino acids surrounding the glycine-rich region conserved between all genes and 13 of 30 conserved between TPKS and either FAS or 6-MSAS.

The glycine-rich segment is part of an overall structural motif for pyridine-nucleotide domains in many proteins (Wierenga, ibid.; Scrutton, N. S., et al., 1990, Nature, 343, 38-43; Ma, Q., et al., 1992, 267, 22298-22304; Hanukoglu, I., and Gutfinger, T., 1989, Eur J. Biochem., 180, 479-484). This structural motif consists of a beta sheet-turn-alpha helix where the glycine rich region codes for the strong turn signal in the middle. In addition, downstream acidic or basic amino acids are positioned to bind to the phosphate (NADP) or hydroxyl group (NAD) on the 2' ribose position. This is depicted in a Chou Fasman analysis of the secondary structure of horse alcohol dehydrogenase as a model NADP binding protein. The analysis of the structural characteristics using the Chou Fasman algorithm indicate that this structural motif is conserved in the rat FAS ER and KR domains, (Witkowski, A., 1991, Eur. J. Biochem., 198, 571-579). The structural predictions of the amino acid sequence of the TPKS ER and KR, as well as the 6MSAS KR, show variations of the model. All predicted structures show a β sheet leading into a turn region, even when amino acid homologies are not strong. It has been suggested that deviations from the structural model may reflect differences in substrate specificity (Ma, Q., supra). It is possible that these structural variations are important in the programming of the PKS, resulting in different levels of reduction of the beta-keto group during successive cycles of the biosynthesis of the triol precursor. Consistent throughout the alignments are the presence of basic amino acids at position 20 to 23 amino acids from the "glycine rich" regions identified by the homology searches. The structural similarities and the presence of these basic amino acids suggest that these regions do indeed represent the keto and enoyl reductases of the TPKS protein.

EXAMPLE 25

Identification of the Acyl Carrier Protein

The last active site identified by alignment of the rat FAS with the TPKS is the acyl carrier protein (ACP) active site serine which binds the 4'-phosphopantetheine prosthetic group. While only 6 out of 30 amino acids surrounding the active site serine are conserved over TPKS, rat FAS and 6-MSAS, a higher degree of homology (13 of 30 amino acids) is observed between TPKS and either rat FAS or 6-MSAS.

EXAMPLE 26

Identification of the Methyl Transferase

One activity identified within the reading frame of the TPKS protein which is not present in rat FAS is the methyl transferase responsible for transfer of the methyl group from S-adenosylmethionine (SAM) to the polyketide chain at position 6. A comparison of both eucaryotic and procaryotic methyl transferases responsible for the methylation of RNA, DNA, and protein substrates has identified a sequence motif thought to be part of the SAM-binding domain (Ingrosso, D. et al., 1989, J. Biol. Chem., 264, 20131-20139; Wu, G. et al, 1992, J. Gen. Micro, 138, 2101-2112). The binding motif and its alignment with the proposed methyl transferase of the TPKS are shown in the figures.

The absence of a methyl group in compactin suggests that the methyl transferase domain may be absent or altered in the compactin PKS.

EXAMPLE 27

A. Transformation of Monascus ruber

Cultures of M. ruber strains M4681 AND M82121 are grown, spheroplasted and transformed essentially according to the procedures described above. Petri dishes are incubated at 28° C. and 65% humidity for 7-10 days before transformed colonies are picked.

B. Fermentation of Monascus

The transformed cultures are grown aerobically in a medium containing 7% glycerol, 3% glucose, 3% meat extract, 0.8% peptone, 0.2% NaNO₃, and 0.1% MgSO₄.7H₂ O at 25 degrees C. for 10 days (Kimura et al., 1990. "Biosyn. of Monacolins, Conversion of Monacolin J. To Monacolin K (Mevinolin)", J. of Antibiotics, Vol. XLIII No. 12, 1621-1622). M. ruber M82121 is grown aerobically at 25° C. for 11 days in a medium containing 11% glycerol, 1% glucose, 5% soy bean powder, 0.8% peptone, 0.1% NaNO₃, 0.05% Zn(NO₃)₂, and 0.5% olive oil (pH 6.5) (Endo, et al., "Dihydromonacolin L and Monacolin X, New Metabolites Those Inhibit Cholesterol Biosynthesis", J. Antibiot., Vol. XXXVIII No. 3, 321-327). The culture broth is extracted with a solvent such as methanol or dichloromethane, concentrated and analyzed by methods such as HPLC. By comparison with an untransformed host or a M. ruber culture containing pLO9 without the TPKS genes, the TPKS100 containing host or a derivative thereof produces increased levels of lovastatin, triol, monacolin, dihydromonacolin L or monacolin X.

EXAMPLE 28

A. Transformation of Paecilomyces viridis

P. viridis strain L-63 is grown, spheroplasted and transformed essentially according to the procedures described above. Cells are transformed with pTPKS100 or a derivative thereof. An example of such a derivative is one in which the DNA encoding the methyl transferase activity of the TPKS protein is altered such that an active methyl transferase is not produced. Petri dishes are incubated at 28° C. and 65% humidity for 7-10 days before transformed colonies are picked.

B. Fermentation of Paecilomyces

P. viridis L-63 is grown aerobically in a medium containing 7% glycerol, 3% glucose, 3% meat extract, 0.8% peptone, 0.2% NaNO₃, and 0.1% MgSO₄.7H₂ O at 25° C. for 4 to 10 days (Kimura et al., supra). The culture broth is extracted with a solvent such as methanol or dichloromethane and concentrated by evaporation if necessary. By comparison with an untransformed host or a P. viridis culture containing pLOA without the TPKS genes, the transformed host can be shown to ferment increased levels of ML-236A and compactin.

EXAMPLE 29

A. Transformation of Penicillium citrinum

A suitable culture of P. citrinum (e.g., Nara, et al., 1993. "Development of a transformation system for the filamentous, ML-236B (compactin)--producing fungus Penicillium citrinum". Curr. Genet., 23, 28-32) is transformed with pTPKS100 or an appropriate derivative thereof using conventional methods.

B. Fermentation of P. citrinum

The transformed culture is maintained on yeast-malt extract agar slant (4 g/l dextrose, 10 g/l malt extract, 4 g/l yeast extract, agar 20 g/l, pH 7 prior to sterilization). The slant is washed and used to inoculate to flasks containing KF seed medium (10 g/l CaCl₂, 5 g/l corn steep liquor, 40 g/l tomato paste, 10 g/l oatmeal, 10 g/l cerelose, 10 ml trace element per liter, pH 6.8; trace elements consist of 1 g FeSO₄.7H₂ 1 g MnSO₄.₄ H₂ O, 25 mg CuCl₂.₂ H₂ O, 100 mg CaCl₂, 56 mg H₃ BO₃, 19 mg (NH₄) 6Mo7024.H₂ O, 200 mg ZnSO₄.₇ H₂ O in liter of dH₂ O). The KF seed flasks are incubated for about 3 days at about 28° C. and 220 rpm. Approximately 1.5 ml is used to inoculate 40 ml of LM production medium per 250 ml flask. LM medium contains 20 g/l dextrose, 20 ml/l glycerol, 10 g/l ardamine pH, 20 g/l malt extract, 8 mg/l CoCls.₆ H₂ O and 0.25% polyglycol P2000, pH 7.0. After 5 to 10 days at 25° C. on a shaker, the broth is collected, extracted and concentrated. The transformed culture produces more compactin and dihydrocompactin than does the untransformed parent culture.

EXAMPLE 30

Cloning of TPKS cDNA into a Mammalian Expression Vector

TPKS cDNA expression cassettes are ligated at appropriate restriction endonuclease sites to the following vectors containing strong, universal mammalian promoters:

Cassettes containing the TPKS cDNA in the positive orientation with respect to the promoter are ligated into appropriate restriction sites 3' of the promoter and identified by restriction site mapping and/or sequencing. These cDNA expression vectors are introduced into various host cells by standard methods including but not limited to electroporation, or chemical procedures (cationic liposomes, DEAE dextran, calcium phosphate). Transfected cells and cell culture supernatants can be harvested and analyzed for TPKS expression as described below.

Vectors used for mammalian transient expression may be used to establish stable cell lines expressing TPKS.

EXAMPLE 31

Cloning of TPKS cDNA into a Baculovirus Expression Vector for Expression in Insect Cells

Baculovirus vectors, which are derived from the genome of the AcNPV virus, are designed to provide high level expression of cDNA in the Sf9 line of insect cells. Recombinant baculoviruses expressing TPKS cDNA are produced essentially by standard methods (In Vitrogen Maxbac Manual). The TPKS cDNA constructs are ligated into the polyhedrin gene in a variety of baculovirus transfer vectors including but not limited to pAC360 and the BlueBac vector (In Vitrogen). Recombinant baculoviruses are generated by homologous recombination following co-transfection of the baculovirus transfer vector and linearized AcNPV genomic DNA Kitts, P. A., Nuc. Acid. Res., 18, 5667 (1990)!into Sf9 cells. Following plaque purification, TPKS expression is measured by the assays described above.

Authentic, enzymatically-active TPKS is found in the cytoplasm of infected cells. Active TPKS is extracted from infected cells under native conditions by hypotonic or detergent lysis.

EXAMPLE 32

Cloning of TPKS cDNA into a yeast expression vector

Recombinant TPKS is produced in the yeast S. cerevisiae following the insertion of the optimal TPKS cDNA cistron into expression vectors designed to direct the intracellular or extracellular expression of heterologous proteins. In the case of intracellular expression, vectors such as EmBLyex4 or the like are ligated to the TPKS cistron Rinas, U. et al., Biotechnology, 8, 543-545 (1990); Horowitz B. et al., J. Biol. Chem. 265, 4189-4192 (1989)!. For extracellular expression, the TPKS cistron is ligated into yeast expression vectors which fuse a secretion signal (a yeast or mammnalian peptide) to the NH₂ terminus of the TPKS protein Jacobson, M. A., Gene, 85, 511-516 (1989); Riett L. and Bellon N., Biochem., 28, 2941-2949 (1989)!.

EXAMPLE 33

Use of TPKS for in vitro production of HMG-CoA inhibitors

Recombinant proteins, including complex proteins, can be overexpressed in a heterologous cells (e.g., Roberts et al., 1993, "Heterologous expression in E. coli of an intact multienzyme component of the erythromycin-producing polyketide synthase". Eur J. Biochem, 214, 305-311). If the recombinant protein is produced in an inclusion body, renaturation of the desired protein is carried out prior to enzymatic assay (Roberts, 1993).

A suitable host cell is transformed with a vector encoding the TPKS gene. The transformed host cell is grown under conditions that permit the expression of TPKS. The expressed TPKS is isolated and partially purified. The recovered active TPKS enzyme can be added to a reaction mixture containing acetyl-CoA or other charged acyl compounds, appropriate cofactors, and buffer. Incubation of the system can result in the formation of HMG-CoA reductase inhibitors.

EXAMPLE 34

Cloning of other PKS genes using TPKS gene

The cross hybridization of the DNA representing portions of the TPKS gene to genomic DNA isolated from other organisms such as M. ruber or P. citrinum, makes it possible to clone the homologous genes from the parent organisms. To do this, a genomic library of M. ruber or P. citrinum was constructed from genomic DNA according to conventional methods. Using, for example, an EMBL vector, an EMBL genomic library was prepared, plated and screened by hybridization with a ³² P-labeled DNA probe consisting of the PstI fragment from the TPKS gene. The PstI fragment contains the keto synthase sequence of the gene. Positive plaques were selected and subjected to additional screening until a purified cross-reacting plaque was selected. The DNA contained in the positive clone is further characterized by physical methods such as restriction mapping, Southern hybridization and DNA sequencing. The function of the defined gene is characterized by cloning the gene in an appropriate transformation vector and transforming a lovastatin non-producing strain with the vector. In the case of M. ruber, the cross-reacting PKS would be expected to restore production of Monacolin K (lovastatin) while introduction of a functional P. citrinum PKS would result in production of compactin.

EXAMPLE 35

Homology of A. terreus TPKS to other strains

A large segment of the 5' end of the A. terreus TPKS gene containing the keto synthase region was used to look for cross-hybridization of this region to other strains, including M. ruber, P citrinum and P. brevicompactum. The homology was examined by Southern analyses with two probes. The Southern showed cross-reaction to all three strains.

The first probe was the PstI fragment, an 800 bps probe which spans the KAS active site. This probe contains intron I 5' of the active site cysteine in addition to the entire KAS region. This probe was used to detect homology in all three strains. A. terreus displayed the profile of cross-reacting bands expected from the restriction map. M. ruber, another lovastatin-producing organism, and P. citrinum, a compactin-producing organism, showed different but strong hybridizations to the probe.

The second probe was a synthetic oligonucleotide probe having the following sequence: 5'GATACGGCATGCAGCTCGTCGTTGGTTGCCGTTCATCTGGCT GCA3' (SEQ ID NO:3). Although the hybridization signal to this probe was weaker than the hybridization to the first probe, the results confirm the observations made with the PstI fragment.

When a 3' end cDNA probe was used, cross reaction to all three strains was observed. Single cross-reacting bands in many of the digests indicate that only one gene is being detected in the genomic DNA of each strain. These data suggest that M. ruber and P. citrinum contain a gene with substantial homology to the TPKS gene of A. terreus.

EXAMPLE 36

Use of mutagenized TPKS

The DNA encoding TPKS is mutagenized using standard methods to produce an altered TPKS gene. Host cells are transformed with the altered TPKS to produce altered triol polyketides or altered polyketides with therapeutic use. The altered TPKS protein may be isolated and purified.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 3                                                   (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 11561 base pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (iii) HYPOTHETICAL: NO                                                         (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: TPKS cDNA                                                        (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        CTGCAGTCAACGGATCACTTACCATTGCTGTCGCCAAAAATATCCGTGATAATCCCGCTG60                 GCTTCATTGGCAAGAGGCTTGACGTACTTGGGAGCTTGGGTCTGGAACTGGTTCATAACC120                ACCTTGGTGATGAGATGTGCATCCCTCGTGACTTCCTTGAATCCATCGAATCCGGGAAGA180                TGAGAGTGAAAGTCCTGATGAGAGCACGAAGATCAGTAAGTCAGGTCCTCACAGCGGAAG240                CAGTTGCAAAGAACGGTGGACTCCTTACCGTGCCCAAGAACTTGTACATACAGAGCTCTT300                TCATCTTGCGAAACTCATCGGCCATAGAGGAGGGAAGAATGGTGCAGTACCCAGAGTCGA360                CTATGAACCGAATGGGCTTATCATTTTGCGAGAACCAGCTCTCAATCCATGACGGTGCAT420                TCGCATCAAAATCCCGTTTGGCCCTCATGGTCGTCAGTTCCCACCATGTTTTCGGATTGA480                ACACCGGCAGATCAGATCTCCGGCCACTCGAGCACAGGTAAAGAAGAAGGCATAGTAGCC540                CCGCACTGGTAGTGACCAAGGGCGCAAACCACGAGCCATGTTGCTGCGTGTCATTCCAAG600                CCAGCGACAGAAGGTGGTGCGGCTGTGTGAGCGCGTCGACAGTCATGGCTAGGAGACCAG660                GTGTGGTTGAGGGATAAGATATCGAGAGTGATGTGAGCAAAAGATCCGGGAAAGGTCGCG720                AAGGAAAGGGCGTCTCTCTTACCAAGAAAGTCTGTTCCCTATCATGCAATCACCGCTTGC780                TGTACGGTGGTGATGATGCTGGGATGGTGGTGGGTCCCCACCGAATAACGCCGGACAGCT840                GTTGAAGCCGAATGACGCCGGCAGGCCAAAAGAACCCTACCTTCACTTACTCAATCGGCG900                CTTCCCCTCCTATCACCAAATCGGATGTAAATGGACGGGCCTTAATAGCGACCGGCCGGG960                CCGGGAATCCCCAAACGTAGATAGATAGGCATAGACCCGAAATCTTTGGCCCGGCATACA1020               TGAGCACAGGAAGTTTCACGCGACGGCGCCTTTCCTGCCTCAGCTTCAATCCAAGCTCAC1080               GAGTTCTGTCGCCTCTATCAGTCGTGCAATTGTCCTACTGCAAACAGCATGGCTCAATCT1140               ATGTATCCTAATGAGCCTATTGTCGTGGTCGGCAGTGGTTGTCGCTTCCCTGGTGACGCC1200               AACACACCCTCCAAGCTCTGGGAGCTACTCCAGCATCCTCGCGATGTGCAGAGTCGAATC1260               CCCAAAGAACGATTTGACGTCGACACATTTTATCACCCGGACGGGAAGCACCACGGGCGA1320               ACAAATGCACCCTACGCCTATGTTCTCCAAGACGATCTGGGCGCCTTCGATGCGGCCTTC1380               TTCAATATCCAGGCTGGAGAGGCCGAGAGTATGGACCCCCAGCACCGGCTGTTGCTGGAG1440               ACGGTGTACGAGGCCGTAACGAATGCTGGAATGCGTATCCAGGATCTGCAGGGAACTTCG1500               ACTGCTGTTTACGTCGGGGTGATGACGCACGACTATGAGACTGTCTCAACCCGCGACCTG1560               GAGAGCATCCCCACCTACTCGGCGACGGGTGTCGCGGTCAGTGTTGCGTCCAACCGCATC1620               TCGTATTTTTTTGACTGGCATGGACCAAGTGTAAGTCACCCAATATCGTGTAGCAGTCTA1680               ATCATGCTCTAACGGACCGGGATGGTTGAAAGATGACGATCGATACGGCATGCAGCTCGT1740               CGTTGGTTGCCGTTCATCTGGCGGTGCAACAGCTACGGACGGGTCAAAGCTCCATGGCAA1800               TTGCTGCGGGTGCGAATCTGATTCTGGGGCCCATGACATTCGTCCTTGAAAGCAAATTGA1860               GCATGCTATCCCCCTCGGGTCGATCCCGCATGTGGGACGCCGGAGCTGACGGCTATGCCA1920               GAGGCGTGAGTGTTTCTTGAGCTCGTAGATGACAGTTCCCATCGCTGACCGTGATCAGGA1980               AGCTGTTTGCTCTGTAGTGTTGAAGACATTGAGTCAAGCCTTGCGCGATGGGGACACGAT2040               TGAATGTGTCATCCGAGAAACTGGGGTGAATCAAGATGGCCGAACGACCGGAATTACGAT2100               GCCGAACCATAGTGCTCAGGAGGCACTCATCAAGGCTACCTACGCCCAGGCTGGCCTTGA2160               CATCACCAAGGCCGAGGACAGGTGCCAATTCTTCGAGGCTCATGGTCAGCAAAGAGAACC2220               TGTTCTGTTGGCGCCCTGCAGCTGACATTCGTATGATAGGGACTGGTACTCCGGCCGGAG2280               ATCCCCAGGAGGCGGAGGCCATTGCAACAGCCTTCTTCGGCCACGAGCAGGTAGCACGCA2340               GCGACGGAAACGAGAGGGCCCCTCTGTTCGTGGGCAGTGCGAAAACTGTTGTCGGGCACA2400               CCGAGGGCACGGCCGGTCTGGCTGGTCTCATGAAGGCGTCGTTCGCTGTCCGCCATGGGG2460               TAATCCCCCCCAACCTGCTGTTCGACAAAATCAGCCCGCGAGTCGCCCCATTCTATAAAA2520               ACCTGAGGATTCCGACAGAAGCTACCCAATGGCCAGCTCTCCCACCCGGACAACCGCGCC2580               GCGCCAGTGTCAACTCCTTTGGTAAGCGAGGATTGCCCGGAGGAACCCTCACAAGTACTC2640               GAATTAATGCTAACTGAACCGCGCCGATGGACAGGATTCGGCGGCACGAATGCGCATGCC2700               ATTATTGAGGAATACATGGAGCCAGAGCAAAACCAGCTGCGAGTCTCGAATAATGAGGAC2760               TGCCCACCCATGACCGGTGTCCTGAGTTTACCCTTAGTCCTCTCGGCGAAGTCCCAGCGC2820               TCCTTAAAGATAATGATGGAGGAGATGCTGCAATTCCTTCAGTCTCACCCCGAGATACAC2880               TTGCACGACCTCACCTGGTCCTTACTGCGCAAGCGGTCAGTTCTACCCTTCCGCCGGGCT2940               ATTGTCGGCCATAGTCATGAAACCATCCGCCGGGCTTTGGAGGATGCCATCGAGGATGGT3000               ATTGTGTCGAGCGACTTCACTACGGAGGTCAGAGGCCAGCCATCGGTGTTGGGAATCTTC3060               ACCGGGCAGGGGGCGCAGTGGCCGGGGATGTTAAAGAATCTGATAGAGGCATCGCCATAT3120               GTGCGGAACATAGTGAGGGAGCTGGACGACTCCCTGCAGAGCTTGCCGGAAAAATACCGG3180               CCCTCGTGGACGCTACTGGACCAGTTCATGCTAGAAGGAGAGGCCTCCAACGTCCAATAT3240               GCTACTTTCTCCCAGCCATTATGCTGCGCGGTGCAAATTGTCCTGGTCCGTCTCCTTGAA3300               GCCGCGAGAATACGATTCACGGCTGTTGTTGGACATAGCTCCGGCGAAATTGCTTGCGCC3360               TTTGCTGCCGGGCTCATCAGTGCCTCGTTGGCGATTCGGATTGCTTACTTACGTGGAGTC3420               GTCTCGGCAGGGGGCGCCAGAGGCACACCGGGAGCCATGTTGGCCGCCGGGATGTCCTTT3480               GAGGAAGCACAAGAGATCTGCGAGTTGGATGCCTTTGAGGGCCGCATCTGCGTGGCTGCC3540               AGCAATTCCCCAGACAGTGTAACTTTCTCTGGCGACGCGAACGCAATTGATCACCTGAAG3600               GGCATGTTGGAGGATGAGTCCACTTTTGCGAGACTGCTCAAGGTCGATACAGCGTACCAC3660               TCGCATCATATGCTTCCATGTGCAGACCCATATATGCAAGCCCTAGAAGAGTGTGGTTGT3720               GCTGTTGCCGATGCAGGTTCCCCAGCCGGAAGTGTACCCTGGTATTCGTCCGTGGACGCC3780               GAGAACAGGCAAATGGCAGCAAGAGACGTGACCGCCAAGTACTGGAAAGATAACTTAGTA3840               TCTCCGGTGCTATTCTCCCACGCAGTGCAGCGGGCAGTCGTCACGCACAAGGCGCTGGAT3900               ATCGGGATTGAAGTGGGCTGTCACCCAGCTCTCAAGAGCCCATGCGTCGCCACCATCAAG3960               GATGTCCTATCTGGGGTTGACCTGGCGTATACAGGTTGCTTGGAGCGAGGAAAGAATGAT4020               CTCGATTCATTCTCTCGAGCACTGGCATATCTCTGGGAAAGGTTTGGTGCCTCCAGTTTC4080               GATGCGGACGAGTTCATGCGTGCAGTCGCGCCTGATCGGCCCTGTATGAGTGTGTCGAAG4140               CTCCTACCGGCCTATCCATGGGACCGCTCTCGTCGCTACTGGGTGGAATCCCGAGCAACT4200               CGCCACCATCTTCGAGGGCCCAAGCCCCATCTTCTATTAGGAAAGCTCTCCGAATACAGC4260               ACTCCGCTAAGCTTCCAGTGGCTGAATTTTGTGCGCCCACGAGACATTGAATGGCTTGAT4320               GGACATGCATTGCAAGGCCAGACTGTCTTCCCTGCGGCCGGCTATATCGTCATGGCAATG4380               GAAGCAGCCTTAATGATTGCTGGCACCCACGCAAAGCAGGTCAAGTTACTGGAGATCTTG4440               GATATGAGCATTGACAAGGCGGTGATATTTGACGACGAAGACAGCTTGGTTGAGCTCAAC4500               CTGACAGCTGACGTGTCTCGCAACGCCGGCGAAGCAGGTTCAATGACCATAAGCTTCAAG4560               ATCGATTCCTGTCTATCGAAGGAGGGTAACCTATCCCTATCAGCCAAGGGCCAACTGGCC4620               CTAACGATAGAAGATGTCAATCCCAGGACGACTTCCGCTAGCGACCAGCACCATCTTCCC4680               CCGCCAGAAGAGGAACATCCTCATATGAACCGTGTCAACATCAATGCTTTCTACCACGAG4740               CTGGGGTTGATGGGGTACAACTACAGTAAGGACTTCCGGCGTCTCCATAACATGCAACGA4800               GCAGATCTTCGAGCCAGCGGCACCTTAGACTTCATTCCTCTGATGGACGAGGGTAATGGC4860               TGTCCTCTCCTGCTGCATCCTGCATCATTGGACGTCGCCTTCCAGACTGTCATCGGCGCA4920               TACTCCTCCCCAGGTGATCGGCGTCTACGCTGTCTGTATGTACCCACTCACGTTGATCGC4980               ATCACACTTGTCCCATCCCTTTGCCTGGCAACGGCTGAGTCCGGATGCGAGAAGGTTGCC5040               TTCAATACTATCAATACGTACGACAAGGGAGACTACTTGAGCGGTGACATTGTGGTGTTT5100               GACGCGGAGCAGACCACCCTGTTCCAGGTTGAAAATATTACTTTTAAGCCCTTTTCACCC5160               CCGGATGCTTCAACTGACCATGCGATGTTTGCCCGATGGAGCTGGGGTCCGTTGACTCCG5220               GACTCGCTGCTGGATAACCCGGAGTATTGGGCCACCGCGCAGGACAAGGAGGCGATTCCT5280               ATTATCGAACGCATCGTCTACTTCTATATCCGATCGTTCCTCAGTCAGCTTACGCTGGAG5340               GAGCGCCAGCAGGCAGCCTTCCATTTGCAGAAGCAGATCGAGTGGCTCGAACAAGTCCTG5400               GCCAGCGCCAAGGAGGGTCGTCACCTATGGTACGACCCCGGGTGGGAGAATGATACTGAG5460               GCCCAGATTGAGCACCTTTGTACTGCTAACTCCTACCACCCTCATGTTCGCCTGGTTCAG5520               CGAGTCGGCCAACACCTGCTCCCCACCGTACGATCGAACGGCAACCCATTCGACCTTCTG5580               GACCACGATGGGCTCCTGACGGAGTTCTATACCAACACACTCAGCTTCGGACCCGCACTA5640               CACTACGCCCGGGAATTGGTGGCGCAGATCGCCCATCGCTATCAGTCAATGGATATTCTG5700               GAGATTGGAGCAGGGACCGGCGGCGCTACCAAGTACGTGTTGGCCACGCCCCAGCTGGGG5760               TTCAACAGCTACACATACACCGATATCTCCACCGGATTCTTCGAGCAAGCGCGGGAGCAA5820               TTTGCCCCCTTCGAGGACCGGATGGTGTTTGAACCCCTCGATATCCGCCGCAGTCCCGCC5880               GAGCAGGGCTTCGAGCCGCATGCCTATGATCTGATCATTGCCTCCAATGTGCTACATGCG5940               ACACCCGACCTAGAGAAAACCATGGCTCACGCCCGCTCTCTGCTCAAGCCTGGAGGCCAG6000               ATGGTTATTCTGGAGATTACCCACAAAGAACACACACGGCTCGGGTTTATCTTTGGTCTG6060               TTCGCCGACTGGTGGGCTGGGGTGGATGATGGTCGCTGCACTGAGCCGTTTGTCTCGTTC6120               GACCGCTGGGATGCGATCCTAAAGCGTGTCGGGTTTTCCGGTGTGGACAGTCGCACCACG6180               GATCGGGACGCAAATCTATTCCCGACCTCTGTGTTTAGTACCCATGCAATTGACGCCACC6240               GTGGAGTACTTAGACGCGCCGCTTGCCAGCAGCGGCACCGTCAAGGACTCTTACCCTCCC6300               TTGGTGGTGGTAGGAGGGCAGACCCCCCAATCTCAGCGTCTCCTGAACGATATAAAAGCG6360               ATCATGCCTCCTCGTCCGCTCCAGACATACAAGCGCCTCGTGGATTTGCTAGACGCGGAG6420               GAGCTGCCGATGAAGTCCACGTTTGTCATGCTCACGGAGCTGGACGAGGAATTATTCGCC6480               GGGCTCACTGAAGAGACCTTCGAGGCAACCAAGCTGCTGCTCACGTACGCCAGCAATACG6540               GTCTGGCTGACAGAAAATGCCTGGGTCCAACATCCTCACCAGGCGAGCACGATCGGCATG6600               CTACGCTCCATCCGCCGGGAGCATCCTGACTTGGGAGTTCATGTTCTGGACGTCGACGCG6660               GTTGAAACCTTCGATGCAACCTTCCTGGTTGAACAGGTGCTTCGGCTTGAGGAGCATACG6720               GATGAGCTGGCCAGTTCAACTACATGGACTCAAGAACCCGAGGTCTCCTGGTGTAAAGGC6780               CGCCCGTGGATTCCTCGTCTGAAGCGCGATCTGGCTCGCAATAACCGAATGAACTCCTCG6840               CGCCGTCCCATATACGAGATGATCGATTCGTCGCGGGCTCCCGTGGCATTACAGACGGCT6900               CGGGATTCATCATCCTACTTCTTGGAGTCCGCTGAAACCTGGTTTGTGCCTGAGAGTGTT6960               CAGCAGATGGAAACAAAGACGATCTATGTCCACTTTAGCTGTCCCCATGCGCTTAGGGTC7020               GGACAGCTCGGGTTTTTCTATCTTGTGCAGGGTCACGTCCAGGAGGGCAATCGCGAAGTG7080               CCCGTCGTGGCCTTAGCAGAGCGTAACGCATCCATTGTGCACGTTCGTCCCGATTATATA7140               TATACTGAGGCAGATAACAATCTGTCTGAGGGTGGTGGCAGCCTTATGGTAACCGTCCTC7200               GCCGCGGCGGTGTTGGCGGAGACGGTGATCAGTACCGCCAAGTGCCTGGGGGTAACTGAC7260               TCAATCCTCGTTCTGAATCCCCCCAGCATATGTGGGCAGATGTTGCTCCATGCTGGTGAA7320               GAGATCGGTCTTCAAGTTCATCTGGCCACCACTTCTGGCAACAGGAGTTCGGTTTCTGCT7380               GGAGACGCCAAGTCCTGGCTAACATTGCATGCTCGCGACACGGACTGGCACCTGCGACGG7440               GTACTGCCCCGGGGTGTCCAGGCTTTAGTCGACTTATCAGCCGACCAGAGCTGTGAAGGT7500               TTGACTCAGAGGATGATGAAAGTTCTGATGCCTGGCTGTGCCCATTACCGTGCGGCAGAC7560               CTGTTCACAGACACCGTTTCCACTGAATTGCATAGCGGATCGCGGCATCAAGCTTCACTG7620               CCCGCCGCATATTGGGAGCATGTGGTATCCTTAGCCCGCCAGGGACTTCCTAGTGTCAGC7680               GAGGGGTGGGAGGTGATGCCGTGCACTCAATTTGCAGCGCATGCCGACAAGACGCGCCCG7740               GATCTCTCGACAGTTATTTCCTGGCCCCGGGAGTCGGACGAGGCTACGCTTCCTACCAGG7800               GTTCGCTCCATTGACGCTGAGACCCTCTTTGCGGCCGACAAAACATATCTCCTGGTCGGA7860               CTGACTGGAGATCTTGGACGATCACTAGGTCGTTGGATGGTCCAGCATGGGGCCTGCCAC7920               ATTGTACTTACGAGCAGAAATCCGCAGGTGAACCCCAAGTGGCTGGCGCATGTTGAAGAA7980               CTGGGTGGTCGAGTCACTGTTCTTTCCATGTAAGAGGAGTCCTTCCTTCTGCAATTCCTC8040               CTTATGATCCCGACTAACGCAGCTGGCTTCAGGGACGTGACAAGCCAAAACTCAGTGGAA8100               GCTGGCCTGGCTAAACTCAAGGATCTGCATCTGCCACCAGTGGGGGGTATTGCCTTTGGC8160               CCTCTGGTTCTGCAGGATGTGATGCTAAATAATATGGAACTGCCAATGATGGAGATGGTG8220               CTCAACCCCAAGGTCGAAGGCGTCCGCATCCTGCACGAGAAGTTCTCCGATCCGACCAGT8280               AGCAACCCTCTCGACTTCTTCGTGATGTTCTCCTCGATTGTGGCCGTCATGGGCAACCCG8340               GGTCAGGCTAACTACAGTGCGGCTAACTGCTACCTTCAAGCGCTGGCGCAGCAGCGAGTT8400               GCATCCGGATTAGCAGTACGTTTTCACTCCATCCTTTGCTAAACACTCCTATGGGCCTTT8460               ACTAAACCGGGCAGGCGTCCACCATCGACATCGGTGCCGTGTACGGCGTTGGGTTCGTCA8520               CTCGGGCGGAGCTGGAGGAGGACTTTAATGCAATTCGGTTCATGTTCGATTCGGTTGAGG8580               AACATGAACTGCATACACTGTTTGCTGAGGCAGTGGTGGCCGGTCGACGAGCCGTGCACC8640               AGCAAGAGCAGCAGCGGAAGTTCGCGACAGTGCTCGACATGGCTGATCTGGAACTGACAA8700               CCGGAATTCCGCCCCTGGATCCAGCCCTCAAAGATCGGATCACCTTCTTCGACGACCCCC8760               GCATAGGCAACTTAAAAATTCCGGAGTACCGAGGGGCCAAAGCAGGCGAAGGGGCAGCCG8820               GCTCCAAGGGCTCGGTCAAAGAACAGCTCTTGCAGGCGACGAACCTGGACCAGGTCCGTC8880               AGATCGTCATCGGTAAGTTGAGCGAATCCGGGGAATATTCTCCCCTTCCTCACTCAGCGG8940               ACTGGAGATTAACCGCTTCTTTTCCTTTGGCAGATGGACTCTCCGCGAAGCTGCAGGTGA9000               CCCTGCAGATCCCCGATGGGGAAAGCGTGCATCCCACCATCCCACTAATCGATCAGGGGG9060               TGGACTCTCTGGGCGCGGTCACCGTGGGAACCTGGTTCTCCAAGCAGCTGTACCTTGATT9120               TGCCACTCCTGAAAGTGCTTGGGGGTGCTTCGATCACCGATCTCGCTAATGAGGCTGCTG9180               CGCGATTGCCACCTAGCTCCATTCCCCTCGTCGCAGCCACCGACGGGGGTGCAGAGAGCA9240               CTGACAATACTTCCGAGAATGAAGTTTCGGGACGCGAGGATACTGACCTTAGTGCCGCCG9300               CCACCATCACTGAGCCCTCGTCTGCCGACGAAGACGATACGGAGCCGGGCGACGAGGACG9360               TCCCGCGTTCCCACCATCCACTGTCTCTCGGGCAAGAATACTCCTGGAGAATCCAGCAGG9420               GAGCCGAAGACCCCACCGTCTTTAACAACACCATTGGTATGTTCATGAAGGGCTCTATTG9480               ACCTTAAACGGCTGTACAAGGCGTTGAGAGCGGTCTTGCGCCGCCACGAGATCTTCCGCA9540               CGGGGTTTGCCAACGTGGATGAGAACGGGATGGCCCAGCTGGTGTTTGGTCAAACCAAAA9600               ACAAAGTCCAGACCATCCAAGTGTCTGACCGAGCCGGCGCCGAAGAGGGCTACCGACAAC9660               TGGTGCAGACACGGTATAACCCTGCCGCAGGAGACACCTTGCGGCTGGTGGACTTCTTCT9720               GGGGCCAGGACGACCATCTGCTGGTTGTGGCTTACCACCGACTCGTCGGGGATGGATCTA9780               CTACAGAGAACATCTTCGTCGAAGCGGGCCAGCTCTACGACGGCACGTCGCTAAGTCCAC9840               ATGTCCCTCAGTTTGCGGACCTGGCGGCACGGCAACGCGCAATGCTCGAGGATGGGAGAA9900               TGGAGGAGGATCTCGCGTACTGGAAGAAAATGCATTACCGACCGTCCTCAATTCCAGTGC9960               TCCCACTGATGCGGCCCCTGGTAGGTAACAGTAGCAGGTCCGATACTCCAAATTTCCAGC10020              ACTGTGGACCCTGGCAGCAGCACGAAGCCGTGGCGCGACTTGATCCGATGGTGGCCTTCC10080              GCATCAAGGAGCGCAGTCGCAAGCACAAGGCGACGCCGATGCAGTTCTATCTGGCGGCGT10140              ATCAGGTGCTGTTGGCGCGCCTCACCGACAGCACCGATCTCACCGTGGGCCTCGCCGACA10200              CCAACCGTGCGACTGTCGACGAGATGGCGGCCATGGGGTTCTTCGCCAACCTCCTTCCCC10260              TGCGCTTCCGGGATTTCCGCCCCCATATAACGTTTGGCGAGCACCTTATCGCCACCCGTG10320              ACCTGGTGCGTGAGGCCTTGCAGCACGCCCGCGTGCCCTACGGCGTCCTCCTCGATCAAC10380              TGGGGCTGGAGGTCCCGGTCCCGACCAGCAATCAACCTGCGCCTTTGTTCCAGGCCGTCT10440              TCGATTACAAGCAGGGCCAGGCGGAAAGTGGAACGATTGGGGGTGCCAAGATAACCGAGG10500              TGATTGCCACGCGCGAGCGCACCCCTTACGATGTCGTGCTGGAGATGTCGGATGATCCCA10560              CCAAGGATCCGCTGCTCACGGCCAAGTTACAGAGTTCCCGCTACGAGGCTCACCACCCTC10620              AAGCCTTCTTGGAGAGCTACATGTCCCTTCTCTCTATGTTCTCGATGAATCCCGCCCTGA10680              AGCTGGCATGATGGCGCAAACATAGAACATGATAGCGCAGCAGGGACGATGTAGATAGAG10740              CTTTGCTTCTGCGGGTGGATCTATAATATAGTATATATAAATATGGTGAGCCGAACGAAG10800              AGGGGGGAATGCCACAATTATTTACTGTTTTGCGCCGTACACGAGGAGAAGACGTCCAGA10860              ACAACATAAATATATCACTCTAGTGAGACACCATATATTCGGAGAGACTATAAAAATATA10920              CATCTACTCCAATGTCTGGGCCGTCACACACAGCTTACGAAAACGATTAATGACCTCCAA10980              CACGTCGCGCGGTCGATTGGGAAACTGATGCTGCCCAGCAAACTCCAATACCTGCGCCTC11040              TCGGGGGGAGAAATGGCGCGCCACCAGCATCTTCGATCCTGCGAGCGCAAAATCATCGCG11100              ACCCTGCAGATGTAATGTCGGTATCCGAATGACCAGTTCCTCCTGCCACTCGGTATCTTT11160              GCTGTCGTTGTCGTCGTCATGGTTCTTCATCATTCGTTCCTCATATACTGGCTTGCCTCG11220              TCTTGATACCAGGGACAGATCAACAGCGCAACACTCATCCGGGGCAACCAGGGCAGGTGA11280              CCCATCTGCTGCTGCCAGAGGAGCAAGGTCGTCACCAGGGCACCTTCGGAGAAACCGATA11340              GCACCCACGATAGGGATGTGGGGGTGTTGAGTCTGCCAGTCGACAATGGTGCGGCGGATG11400              GGGTCGTGGACGGCGGCGAGGCGTTCGCTCACGGAGGGTCCATTATGATTGTTGTCGCTG11460              CTGCTTTCAAACCAGGAGTAATATGGCCCTAGGTCGGCGAAGACGGGGAGAATCCCAGGC11520              CCTGCAGAGGAAGGGAACGGAGCTGTCACGTAGACGAATTC11561                                 (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 3038 amino acids                                                   (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL: YES                                                        (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: TPKS Protein                                                     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        MetAlaGlnSerMetTyrProAsnGluProIleValValValGlySer                               151015                                                                         GlyCysArgPheProGlyAspAlaAsnThrProSerLysLeuTrpGlu                               202530                                                                         LeuLeuGlnHisProArgAspValGlnSerArgIleProLysGluArg                               354045                                                                         PheAspValAspThrPheTyrHisProAspGlyLysHisHisGlyArg                               505560                                                                         ThrAsnAlaProTyrAlaTyrValLeuGlnAspAspLeuGlyAlaPhe                               65707580                                                                       AspAlaAlaPhePheAsnIleGlnAlaGlyGluAlaGluSerMetAsp                               859095                                                                         ProGlnHisArgLeuLeuLeuGluThrValTyrGluAlaValThrAsn                               100105110                                                                      AlaGlyMetArgIleGlnAspLeuGlnGlyThrSerThrAlaValTyr                               115120125                                                                      ValGlyValMetThrHisAspTyrGluThrValSerThrArgAspLeu                               130135140                                                                      GluSerIleProThrTyrSerAlaThrGlyValAlaValSerValAla                               145150155160                                                                   SerAsnArgIleSerTyrPhePheAspTrpHisGlyProSerMetThr                               165170175                                                                      IleAspThrAlaCysSerSerSerLeuValAlaValHisLeuAlaVal                               180185190                                                                      GlnGlnLeuArgThrGlyGlnSerSerMetAlaIleAlaAlaGlyAla                               195200205                                                                      AsnLeuIleLeuGlyProMetThrPheValLeuGluSerLysLeuSer                               210215220                                                                      MetLeuSerProSerGlyArgSerArgMetTrpAspAlaGlyAlaAsp                               225230235240                                                                   GlyTyrAlaArgGlyGluAlaValCysSerValValLeuLysThrLeu                               245250255                                                                      SerGlnAlaLeuArgAspGlyAspThrIleGluCysValIleArgGlu                               260265270                                                                      ThrGlyValAsnGlnAspGlyArgThrThrGlyIleThrMetProAsn                               275280285                                                                      HisSerAlaGlnGluAlaLeuIleLysAlaThrTyrAlaGlnAlaGly                               290295300                                                                      LeuAspIleThrLysAlaGluAspArgCysGlnPhePheGluAlaHis                               305310315320                                                                   GlyThrGlyThrProAlaGlyAspProGlnGluAlaGluAlaIleAla                               325330335                                                                      ThrAlaPhePheGlyHisGluGlnValAlaArgSerAspGlyAsnGlu                               340345350                                                                      ArgAlaProLeuPheValGlySerAlaLysThrValValGlyHisThr                               355360365                                                                      GluGlyThrAlaGlyLeuAlaGlyLeuMetLysAlaSerPheAlaVal                               370375380                                                                      ArgHisGlyValIleProProAsnLeuLeuPheAspLysIleSerPro                               385390395400                                                                   ArgValAlaProPheTyrLysAsnLeuArgIleProThrGluAlaThr                               405410415                                                                      GlnTrpProAlaLeuProProGlyGlnProArgArgAlaSerValAsn                               420425430                                                                      SerPheGlyPheGlyGlyThrAsnAlaHisAlaIleIleGluGluTyr                               435440445                                                                      MetGluProGluGlnAsnGlnLeuArgValSerAsnAsnGluAspCys                               450455460                                                                      ProProMetThrGlyValLeuSerLeuProLeuValLeuSerAlaLys                               465470475480                                                                   SerGlnArgSerLeuLysIleMetMetGluGluMetLeuGlnPheLeu                               485490495                                                                      GlnSerHisProGluIleHisLeuHisAspLeuThrTrpSerLeuLeu                               500505510                                                                      ArgLysArgSerValLeuProPheArgArgAlaIleValGlyHisSer                               515520525                                                                      HisGluThrIleArgArgAlaLeuGluAspAlaIleGluAspGlyIle                               530535540                                                                      ValSerSerAspPheThrThrGluValArgGlyGlnProSerValLeu                               545550555560                                                                   GlyIlePheThrGlyGlnGlyAlaGlnTrpProGlyMetLeuLysAsn                               565570575                                                                      LeuIleGluAlaSerProTyrValArgAsnIleValArgGluLeuAsp                               580585590                                                                      AspSerLeuGlnSerLeuProGluLysTyrArgProSerTrpThrLeu                               595600605                                                                      LeuAspGlnPheMetLeuGluGlyGluAlaSerAsnValGlnTyrAla                               610615620                                                                      ThrPheSerGlnProLeuCysCysAlaValGlnIleValLeuValArg                               625630635640                                                                   LeuLeuGluAlaAlaArgIleArgPheThrAlaValValGlyHisSer                               645650655                                                                      SerGlyGluIleAlaCysAlaPheAlaAlaGlyLeuIleSerAlaSer                               660665670                                                                      LeuAlaIleArgIleAlaTyrLeuArgGlyValValSerAlaGlyGly                               675680685                                                                      AlaArgGlyThrProGlyAlaMetLeuAlaAlaGlyMetSerPheGlu                               690695700                                                                      GluAlaGlnGluIleCysGluLeuAspAlaPheGluGlyArgIleCys                               705710715720                                                                   ValAlaAlaSerAsnSerProAspSerValThrPheSerGlyAspAla                               725730735                                                                      AsnAlaIleAspHisLeuLysGlyMetLeuGluAspGluSerThrPhe                               740745750                                                                      AlaArgLeuLeuLysValAspThrAlaTyrHisSerHisHisMetLeu                               755760765                                                                      ProCysAlaAspProTyrMetGlnAlaLeuGluGluCysGlyCysAla                               770775780                                                                      ValAlaAspAlaGlySerProAlaGlySerValProTrpTyrSerSer                               785790795800                                                                   ValAspAlaGluAsnArgGlnMetAlaAlaArgAspValThrAlaLys                               805810815                                                                      TyrTrpLysAspAsnLeuValSerProValLeuPheSerHisAlaVal                               820825830                                                                      GlnArgAlaValValThrHisLysAlaLeuAspIleGlyIleGluVal                               835840845                                                                      GlyCysHisProAlaLeuLysSerProCysValAlaThrIleLysAsp                               850855860                                                                      ValLeuSerGlyValAspLeuAlaTyrThrGlyCysLeuGluArgGly                               865870875880                                                                   LysAsnAspLeuAspSerPheSerArgAlaLeuAlaTyrLeuTrpGlu                               885890895                                                                      ArgPheGlyAlaSerSerPheAspAlaAspGluPheMetArgAlaVal                               900905910                                                                      AlaProAspArgProCysMetSerValSerLysLeuLeuProAlaTyr                               915920925                                                                      ProTrpAspArgSerArgArgTyrTrpValGluSerArgAlaThrArg                               930935940                                                                      HisHisLeuArgGlyProLysProHisLeuLeuLeuGlyLysLeuSer                               945950955960                                                                   GluTyrSerThrProLeuSerPheGlnTrpLeuAsnPheValArgPro                               965970975                                                                      ArgAspIleGluTrpLeuAspGlyHisAlaLeuGlnGlyGlnThrVal                               980985990                                                                      PheProAlaAlaGlyTyrIleValMetAlaMetGluAlaAlaLeuMet                               99510001005                                                                    IleAlaGlyThrHisAlaLysGlnValLysLeuLeuGluIleLeuAsp                               101010151020                                                                   MetSerIleAspLysAlaValIlePheAspAspGluAspSerLeuVal                               1025103010351040                                                               GluLeuAsnLeuThrAlaAspValSerArgAsnAlaGlyGluAlaGly                               104510501055                                                                   SerMetThrIleSerPheLysIleAspSerCysLeuSerLysGluGly                               106010651070                                                                   AsnLeuSerLeuSerAlaLysGlyGlnLeuAlaLeuThrIleGluAsp                               107510801085                                                                   ValAsnProArgThrThrSerAlaSerAspGlnHisHisLeuProPro                               109010951100                                                                   ProGluGluGluHisProHisMetAsnArgValAsnIleAsnAlaPhe                               1105111011151120                                                               TyrHisGluLeuGlyLeuMetGlyTyrAsnTyrSerLysAspPheArg                               112511301135                                                                   ArgLeuHisAsnMetGlnArgAlaAspLeuArgAlaSerGlyThrLeu                               114011451150                                                                   AspPheIleProLeuMetAspGluGlyAsnGlyCysProLeuLeuLeu                               115511601165                                                                   HisProAlaSerLeuAspValAlaPheGlnThrValIleGlyAlaTyr                               117011751180                                                                   SerSerProGlyAspArgArgLeuArgCysLeuTyrValProThrHis                               1185119011951200                                                               ValAspArgIleThrLeuValProSerLeuCysLeuAlaThrAlaGlu                               120512101215                                                                   SerGlyCysGluLysValAlaPheAsnThrIleAsnThrTyrAspLys                               122012251230                                                                   GlyAspTyrLeuSerGlyAspIleValValPheAspAlaGluGlnThr                               123512401245                                                                   ThrLeuPheGlnValGluAsnIleThrPheLysProPheSerProPro                               125012551260                                                                   AspAlaSerThrAspHisAlaMetPheAlaArgTrpSerTrpGlyPro                               1265127012751280                                                               LeuThrProAspSerLeuLeuAspAsnProGluTyrTrpAlaThrAla                               128512901295                                                                   GlnAspLysGluAlaIleProIleIleGluArgIleValTyrPheTyr                               130013051310                                                                   IleArgSerPheLeuSerGlnLeuThrLeuGluGluArgGlnGlnAla                               131513201325                                                                   AlaPheHisLeuGlnLysGlnIleGluTrpLeuGluGlnValLeuAla                               133013351340                                                                   SerAlaLysGluGlyArgHisLeuTrpTyrAspProGlyTrpGluAsn                               1345135013551360                                                               AspThrGluAlaGlnIleGluHisLeuCysThrAlaAsnSerTyrHis                               136513701375                                                                   ProHisValArgLeuValGlnArgValGlyGlnHisLeuLeuProThr                               138013851390                                                                   ValArgSerAsnGlyAsnProPheAspLeuLeuAspHisAspGlyLeu                               139514001405                                                                   LeuThrGluPheTyrThrAsnThrLeuSerPheGlyProAlaLeuHis                               141014151420                                                                   TyrAlaArgGluLeuValAlaGlnIleAlaHisArgTyrGlnSerMet                               1425143014351440                                                               AspIleLeuGluIleGlyAlaGlyThrGlyGlyAlaThrLysTyrVal                               144514501455                                                                   LeuAlaThrProGlnLeuGlyPheAsnSerTyrThrTyrThrAspIle                               146014651470                                                                   SerThrGlyPhePheGluGlnAlaArgGluGlnPheAlaProPheGlu                               147514801485                                                                   AspArgMetValPheGluProLeuAspIleArgArgSerProAlaGlu                               149014951500                                                                   GlnGlyPheGluProHisAlaTyrAspLeuIleIleAlaSerAsnVal                               1505151015151520                                                               LeuHisAlaThrProAspLeuGluLysThrMetAlaHisAlaArgSer                               152515301535                                                                   LeuLeuLysProGlyGlyGlnMetValIleLeuGluIleThrHisLys                               154015451550                                                                   GluHisThrArgLeuGlyPheIlePheGlyLeuPheAlaAspTrpTrp                               155515601565                                                                   AlaGlyValAspAspGlyArgCysThrGluProPheValSerPheAsp                               157015751580                                                                   ArgTrpAspAlaIleLeuLysArgValGlyPheSerGlyValAspSer                               1585159015951600                                                               ArgThrThrAspArgAspAlaAsnLeuPheProThrSerValPheSer                               160516101615                                                                   ThrHisAlaIleAspAlaThrValGluTyrLeuAspAlaProLeuAla                               162016251630                                                                   SerSerGlyThrValLysAspSerTyrProProLeuValValValGly                               163516401645                                                                   GlyGlnThrProGlnSerGlnArgLeuLeuAsnAspIleLysAlaIle                               165016551660                                                                   MetProProArgProLeuGlnThrTyrLysArgLeuValAspLeuLeu                               1665167016751680                                                               AspAlaGluGluLeuProMetLysSerThrPheValMetLeuThrGlu                               168516901695                                                                   LeuAspGluGluLeuPheAlaGlyLeuThrGluGluThrPheGluAla                               170017051710                                                                   ThrLysLeuLeuLeuThrTyrAlaSerAsnThrValTrpLeuThrGlu                               171517201725                                                                   AsnAlaTrpValGlnHisProHisGlnAlaSerThrIleGlyMetLeu                               173017351740                                                                   ArgSerIleArgArgGluHisProAspLeuGlyValHisValLeuAsp                               1745175017551760                                                               ValAspAlaValGluThrPheAspAlaThrPheLeuValGluGlnVal                               176517701775                                                                   LeuArgLeuGluGluHisThrAspGluLeuAlaSerSerThrThrTrp                               178017851790                                                                   ThrGlnGluProGluValSerTrpCysLysGlyArgProTrpIlePro                               179518001805                                                                   ArgLeuLysArgAspLeuAlaArgAsnAsnArgMetAsnSerSerArg                               181018151820                                                                   ArgProIleTyrGluMetIleAspSerSerArgAlaProValAlaLeu                               1825183018351840                                                               GlnThrAlaArgAspSerSerSerTyrPheLeuGluSerAlaGluThr                               184518501855                                                                   TrpPheValProGluSerValGlnGlnMetGluThrLysThrIleTyr                               186018651870                                                                   ValHisPheSerCysProHisAlaLeuArgValGlyGlnLeuGlyPhe                               187518801885                                                                   PheTyrLeuValGlnGlyHisValGlnGluGlyAsnArgGluValPro                               189018951900                                                                   ValValAlaLeuAlaGluArgAsnAlaSerIleValHisValArgPro                               1905191019151920                                                               AspTyrIleTyrThrGluAlaAspAsnAsnLeuSerGluGlyGlyGly                               192519301935                                                                   SerLeuMetValThrValLeuAlaAlaAlaValLeuAlaGluThrVal                               194019451950                                                                   IleSerThrAlaLysCysLeuGlyValThrAspSerIleLeuValLeu                               195519601965                                                                   AsnProProSerIleCysGlyGlnMetLeuLeuHisAlaGlyGluGlu                               197019751980                                                                   IleGlyLeuGlnValHisLeuAlaThrThrSerGlyAsnArgSerSer                               1985199019952000                                                               ValSerAlaGlyAspAlaLysSerTrpLeuThrLeuHisAlaArgAsp                               200520102015                                                                   ThrAspTrpHisLeuArgArgValLeuProArgGlyValGlnAlaLeu                               202020252030                                                                   ValAspLeuSerAlaAspGlnSerCysGluGlyLeuThrGlnArgMet                               203520402045                                                                   MetLysValLeuMetProGlyCysAlaHisTyrArgAlaAlaAspLeu                               205020552060                                                                   PheThrAspThrValSerThrGluLeuHisSerGlySerArgHisGln                               2065207020752080                                                               AlaSerLeuProAlaAlaTyrTrpGluHisValValSerLeuAlaArg                               208520902095                                                                   GlnGlyLeuProSerValSerGluGlyTrpGluValMetProCysThr                               210021052110                                                                   GlnPheAlaAlaHisAlaAspLysThrArgProAspLeuSerThrVal                               211521202125                                                                   IleSerTrpProArgGluSerAspGluAlaThrLeuProThrArgVal                               213021352140                                                                   ArgSerIleAspAlaGluThrLeuPheAlaAlaAspLysThrTyrLeu                               2145215021552160                                                               LeuValGlyLeuThrGlyAspLeuGlyArgSerLeuGlyArgTrpMet                               216521702175                                                                   ValGlnHisGlyAlaCysHisIleValLeuThrSerArgAsnProGln                               218021852190                                                                   ValAsnProLysTrpLeuAlaHisValGluGluLeuGlyGlyArgVal                               219522002205                                                                   ThrValLeuSerMetAspValThrSerGlnAsnSerValGluAlaGly                               221022152220                                                                   LeuAlaLysLeuLysAspLeuHisLeuProProValGlyGlyIleAla                               2225223022352240                                                               PheGlyProLeuValLeuGlnAspValMetLeuAsnAsnMetGluLeu                               224522502255                                                                   ProMetMetGluMetValLeuAsnProLysValGluGlyValArgIle                               226022652270                                                                   LeuHisGluLysPheSerAspProThrSerSerAsnProLeuAspPhe                               227522802285                                                                   PheValMetPheSerSerIleValAlaValMetGlyAsnProGlyGln                               229022952300                                                                   AlaAsnTyrSerAlaAlaAsnCysTyrLeuGlnAlaLeuAlaGlnGln                               2305231023152320                                                               ArgValAlaSerGlyLeuAlaAlaSerThrIleAspIleGlyAlaVal                               232523302335                                                                   TyrGlyValGlyPheValThrArgAlaGluLeuGluGluAspPheAsn                               234023452350                                                                   AlaIleArgPheMetPheAspSerValGluGluHisGluLeuHisThr                               235523602365                                                                   LeuPheAlaGluAlaValValAlaGlyArgArgAlaValHisGlnGln                               237023752380                                                                   GluGlnGlnArgLysPheAlaThrValLeuAspMetAlaAspLeuGlu                               2385239023952400                                                               LeuThrThrGlyIleProProLeuAspProAlaLeuLysAspArgIle                               240524102415                                                                   ThrPhePheAspAspProArgIleGlyAsnLeuLysIleProGluTyr                               242024252430                                                                   ArgGlyAlaLysAlaGlyGluGlyAlaAlaGlySerLysGlySerVal                               243524402445                                                                   LysGluGlnLeuLeuGlnAlaThrAsnLeuAspGlnValArgGlnIle                               245024552460                                                                   ValIleAspGlyLeuSerAlaLysLeuGlnValThrLeuGlnIlePro                               2465247024752480                                                               AspGlyGluSerValHisProThrIleProLeuIleAspGlnGlyVal                               248524902495                                                                   AspSerLeuGlyAlaValThrValGlyThrTrpPheSerLysGlnLeu                               250025052510                                                                   TyrLeuAspLeuProLeuLeuLysValLeuGlyGlyAlaSerIleThr                               251525202525                                                                   AspLeuAlaAsnGluAlaAlaAlaArgLeuProProSerSerIlePro                               253025352540                                                                   LeuValAlaAlaThrAspGlyGlyAlaGluSerThrAspAsnThrSer                               2545255025552560                                                               GluAsnGluValSerGlyArgGluAspThrAspLeuSerAlaAlaAla                               256525702575                                                                   ThrIleThrGluProSerSerAlaAspGluAspAspThrGluProGly                               258025852590                                                                   AspGluAspValProArgSerHisHisProLeuSerLeuGlyGlnGlu                               259526002605                                                                   TyrSerTrpArgIleGlnGlnGlyAlaGluAspProThrValPheAsn                               261026152620                                                                   AsnThrIleGlyMetPheMetLysGlySerIleAspLeuLysArgLeu                               2625263026352640                                                               TyrLysAlaLeuArgAlaValLeuArgArgHisGluIlePheArgThr                               264526502655                                                                   GlyPheAlaAsnValAspGluAsnGlyMetAlaGlnLeuValPheGly                               266026652670                                                                   GlnThrLysAsnLysValGlnThrIleGlnValSerAspArgAlaGly                               267526802685                                                                   AlaGluGluGlyTyrArgGlnLeuValGlnThrArgTyrAsnProAla                               269026952700                                                                   AlaGlyAspThrLeuArgLeuValAspPhePheTrpGlyGlnAspAsp                               2705271027152720                                                               HisLeuLeuValValAlaTyrHisArgLeuValGlyAspGlySerThr                               272527302735                                                                   ThrGluAsnIlePheValGluAlaGlyGlnLeuTyrAspGlyThrSer                               274027452750                                                                   LeuSerProHisValProGlnPheAlaAspLeuAlaAlaArgGlnArg                               275527602765                                                                   AlaMetLeuGluAspGlyArgMetGluGluAspLeuAlaTyrTrpLys                               277027752780                                                                   LysMetHisTyrArgProSerSerIleProValLeuProLeuMetArg                               2785279027952800                                                               ProLeuValGlyAsnSerSerArgSerAspThrProAsnPheGlnHis                               280528102815                                                                   CysGlyProTrpGlnGlnHisGluAlaValAlaArgLeuAspProMet                               282028252830                                                                   ValAlaPheArgIleLysGluArgSerArgLysHisLysAlaThrPro                               283528402845                                                                   MetGlnPheTyrLeuAlaAlaTyrGlnValLeuLeuAlaArgLeuThr                               285028552860                                                                   AspSerThrAspLeuThrValGlyLeuAlaAspThrAsnArgAlaThr                               2865287028752880                                                               ValAspGluMetAlaAlaMetGlyPhePheAlaAsnLeuLeuProLeu                               288528902895                                                                   ArgPheArgAspPheArgProHisIleThrPheGlyGluHisLeuIle                               290029052910                                                                   AlaThrArgAspLeuValArgGluAlaLeuGlnHisAlaArgValPro                               291529202925                                                                   TyrGlyValLeuLeuAspGlnLeuGlyLeuGluValProValProThr                               293029352940                                                                   SerAsnGlnProAlaProLeuPheGlnAlaValPheAspTyrLysGln                               2945295029552960                                                               GlyGlnAlaGluSerGlyThrIleGlyGlyAlaLysIleThrGluVal                               296529702975                                                                   IleAlaThrArgGluArgThrProTyrAspValValLeuGluMetSer                               298029852990                                                                   AspAspProThrLysAspProLeuLeuThrAlaLysLeuGlnSerSer                               299530003005                                                                   ArgTyrGluAlaHisHisProGlnAlaPheLeuGluSerTyrMetSer                               301030153020                                                                   LeuLeuSerMetPheSerMetAsnProAlaLeuLysLeuAla                                     302530303035                                                                   (2) INFORMATION FOR SEQ ID NO:3:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 45 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: probe                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                        GATACGGCATGCAGCTCGTCGTTGGTTGCCGTTCATCTGGCTGCA45                                __________________________________________________________________________ 

What is claimed is:
 1. A purified and isolated DNA molecule encoding a triol polyketide synthase of Aspergillus terreus said DNA molecule having a nucleotide sequence set forth in SEQ ID NO:1.
 2. An expression vector comprising a DNA molecule of claim
 1. 3. A host cell transformed with an expression vector of claim
 2. 4. The expression vector of claim 2 which is pTPKS100 (ATCC 69416).
 5. A host cell transformed with the expression vector of claim
 4. 6. A process for producing HMG-CoA reductase inhibitors, comprising:(a) transforming a cell with a DNA molecule of claim 1; (b) cultivating the transformed cell under conditions that permit the expression of the DNA molecule; and (c) recovering the HMG-CoA reductase inhibitor.
 7. The process of claim 6 wherein the HMG-CoA reductase inhibitors are selected from the group consisting of lovastatin, triol and compactin.
 8. The process of claim 6 wherein said transformed cell is selected from the group consisting of cells of Aspergillus terreus, Monascus ruber, Penicillum citrinum, Penicillum brevicompactum, Hypomyces chrysospermus, Paecilomyces viridis, Paecilomyces sp. M2016, Eupenicillium sp. MM603, Trichoderma longibrachiatum M6735 and Trichoderma pseudokoningii M6828.
 9. A method of isolating DNA encoding polyketide synthase, comprising:(a) hybridizing a DNA of claim 1 to a sample containing DNA encoding polyketide synthase to form a complex; and (b) purifying the complex.
 10. The method of claim 9 wherein the sample is derived from a microorganism, the microorganism being selected from the group consisting of Aspergillus terreus, Monascus ruber, Penicillum citrinum, Penicillum brevicompactum, Hypomyces chrysopermus, Paecilomyces viridis, Paecilomyces sp. M2016, Eupenicillium sp. MM603, Trichoderma longibrachiatum. M6735 and Trichoderma pseudokoningii M6828. 